ggenealogy: An R Package for Visualizing Genealogical Data

Lindsay Rutter, Susan VanderPlas, Dianne Cook, Michelle A. Graham

Main Article Content

Abstract

This paper introduces ggenealogy (Rutter, Vanderplas, and Cook 2019), a developing R software package that provides tools for searching through genealogical data, generating basic statistics on their graphical structures using parent and child connections, parsing and performing calculations on branches of interest, and displaying the results. It is possible to draw the genealogy in relation to variables related to the nodes, and to determine and display the shortest path distances between the nodes. Production of pairwise distance matrices and genealogical diagrams constrained on generation are also available in the visualization toolkit. The tools are being tested on a dataset with milestone cultivars of soybean varieties (Hymowitz, Newell, and Carmer 1977) as well as on a web-based database of the academic genealogy of mathematicians (North Dakota State University and American Mathematical Society 2010). The latest stable package version is available in source and binary form on the Comprehensive R Archive Network (CRAN).

Article Details

Article Sidebar

References

Coster A (2013). pedigree: Pedigree Functions. R package version 0.4, URL https://CRAN.R-project.org/package=pedigree.

Csardi G, Nepusz T (2006). The igraph Software Package for Complex Network Research. InterJournal, Complex Systems, 1695. URL http://igraph.sf.net.

Gansner ER, North SC (2000). An Open Graph Visualization System and its Applications to Software Engineering. Software - Practice and Experience, 30(11), 1203-1233.

Hymowitz T, Newell C, Carmer S (1977). Pedigrees of Soybean Cultivars Released in the United States and Canada. International Soybean Series, College of Agriculture, University

of Illinois at Urbana-Champaign, Urbana, IL.

North Dakota State University and American Mathematical Society (2010). The Mathematics Genealogy Project. Archived Web Site. Retrieved from the Library of Congress, Accessed on March 6, 2015, URL http://www.genealogy.math.ndsu.nodak.edu.

PostgreSQL (2016). URL http://www.postgresql.org/.

RStudio, Inc (2014). shiny: Web Application Framework for R. R package version 0.10.2.1, URL http://CRAN.R-project.org/package=shiny.

Rutter L, Vanderplas S, Cook D (2015). ggenealogy: Visualization Tools for Genealogical Data. R package version 0.1.0, URL https://CRAN.R-project.org/package=ggenealogy.

Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003). Cytoscape: a Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Research, 13(11), 2498-2504.

Sievert C, Parmer C, Hocking T, Chamberlain S, Ram K, Corvellec M, Despouy P (2016). plotly: Create Interactive Web Graphics via 'plotly.js'. R package version 3.4.13, URL https://CRAN.R-project.org/package=plotly.

Therneau T, Daniel S, Sinnwell J, Atkinson E (2015). kinship2: Pedigree Functions. R package version 1.6.4, URL https://CRAN.R-project.org/package=kinship2.

Wickham H (2009). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag, New York, NY. URL http://had.co.nz/ggplot2/book.

Wickham H, Francois R (2015). dplyr: A Grammar of Data Manipulation. R package version 0.4.3, URL https://CRAN.R-project.org/package=dplyr.