Abstract
Given a phylogenetic tree \(\mathcal{T}\) of n nodes, and a sample R of its tips (leaf nodes) a very common problem in ecological and evolutionary research is to evaluate a distance measure for the elements in R. Two of the most common measures of this kind are the Mean Pairwise Distance (\(\ensuremath{\mathrm{MPD}} \)) and the Phylogenetic Diversity (\(\ensuremath{\mathrm{PD}} \)). In many applications, it is often necessary to compute the expectation and standard deviation of one of these measures over all subsets of tips of \(\mathcal{T}\) that have a certain size. Unfortunately, existing methods to calculate the expectation and deviation of these measures are inexact and inefficient.
We present analytical expressions that lead to efficient algorithms for computing the expectation and the standard deviation of the MPD and the PD. More specifically, our main contributions are:
-
1
We present efficient algorithms for computing the expectation and the standard deviation of the MPD exactly, in Θ(n) time.
-
2
We provide a Θ(n) time algorithm for computing approximately the expectation of the PD and a O(n 2) time algorithm for computing approximately the standard deviation of the PD. We also describe the major computational obstacles that hinder the exact calculation of these concepts.
We also describe O(n) time algorithms for evaluating the MPD and PD given a single sample of tips. Having implemented all the presented algorithms, we assess their efficiency experimentally using as a point of reference a standard software package for processing phylogenetic trees.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bininda-Emonds, O.R.P., Cardillo, M., Jones, K.E., MacPhee, R.D.E., Beck, R.M.D., Grenyer, R., Price, S.A., Vos, R.A., Gittleman, J.L., Purvis, A.: The delayed rise of present-day mammals. Nature 446, 507–512 (2007)
Cavendar-Bares, J., Ackerly, D.D., Baum, D., Bazzaz, F.A.: Phylogenetic overdispersion in the assembly of Floridian oak communities. American Naturalist 163, 823–843 (2004)
Faith, D.P.: Conservation evaluation and phylogenetic diversity. Biological Conservation 61, 1–10 (1992)
Faller, B., Pardi, F., Steel, M.: Distribution of phylogenetic diversity under random extinction. Journal of Theoretical Biology 251, 286–296 (2008)
Felsenstein, J.: PHYLIP: Phylogeny inference package, version 3.57c. Distributed by the author, Department of Genetics. Univ. of Washington (1995)
Graham, C.H., Fine, P.V.A.: Phylogenetic beta diversity: linking ecological and evolutionary processes across space and time. Ecology Letters 11, 1265–1277 (2008)
Hartmann, K., Steel, M.: Phylogenetic diversity: From combinatorics to ecology. In: Gascuel, O., Steel, M. (eds.) Reconstructing Evolution: New Mathematical and Computational Approaches. Oxford University Press (2007)
Kembel, S.W., Ackerly, D.D., Blomberg, S.P., Cornwell, W.K., Cowan, P.D., Helmus, M.R., Morlon, H., Webb, C.O.: Documentation for picante R package (2011)
Kissling, W.D., Eiserhardt, W.L., Baker, W.J., Borchsenius, F., Couvreur, T.L.P., Balslev, H., Svenning, J.-C.: Cenozoic imprints on the phylogenetic structure of palm species assemblages worldwide. Proc. National Academy of Sciences 109, 7379–7384 (2012)
R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna (2010)
Steel, M.: Tools to construct and study big trees: A mathematical perspective. In: Hodkinson, T., Parnell, J., Waldren, S. (eds.) Reconstructing the Tree of Life: Taxonomy and Systematics of Species Rich Taxa, pp. 97–112. CRC Press (2007)
Webb, C.O., Ackerly, D.D., McPeek, M.A., Donoghue, M.J.: Phylogenies and community ecology. Annual Review of Ecology and Systematics 33, 475–505 (2002)
Webb, C., Ackerly, D., Kembel, S.: Phylocom Users Manual, version 4.2 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tsirogiannis, C., Sandel, B., Cheliotis, D. (2012). Efficient Computation of Popular Phylogenetic Tree Measures. In: Raphael, B., Tang, J. (eds) Algorithms in Bioinformatics. WABI 2012. Lecture Notes in Computer Science(), vol 7534. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33122-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-33122-0_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33121-3
Online ISBN: 978-3-642-33122-0
eBook Packages: Computer ScienceComputer Science (R0)