Skip to main content

Efficient Computation of Popular Phylogenetic Tree Measures

  • Conference paper
Book cover Algorithms in Bioinformatics (WABI 2012)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 7534))

Included in the following conference series:

Abstract

Given a phylogenetic tree \(\mathcal{T}\) of n nodes, and a sample R of its tips (leaf nodes) a very common problem in ecological and evolutionary research is to evaluate a distance measure for the elements in R. Two of the most common measures of this kind are the Mean Pairwise Distance (\(\ensuremath{\mathrm{MPD}} \)) and the Phylogenetic Diversity (\(\ensuremath{\mathrm{PD}} \)). In many applications, it is often necessary to compute the expectation and standard deviation of one of these measures over all subsets of tips of \(\mathcal{T}\) that have a certain size. Unfortunately, existing methods to calculate the expectation and deviation of these measures are inexact and inefficient.

We present analytical expressions that lead to efficient algorithms for computing the expectation and the standard deviation of the MPD and the PD. More specifically, our main contributions are:

  1. 1

    We present efficient algorithms for computing the expectation and the standard deviation of the MPD exactly, in Θ(n) time.

  2. 2

    We provide a Θ(n) time algorithm for computing approximately the expectation of the PD and a O(n 2) time algorithm for computing approximately the standard deviation of the PD. We also describe the major computational obstacles that hinder the exact calculation of these concepts.

We also describe O(n) time algorithms for evaluating the MPD and PD given a single sample of tips. Having implemented all the presented algorithms, we assess their efficiency experimentally using as a point of reference a standard software package for processing phylogenetic trees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bininda-Emonds, O.R.P., Cardillo, M., Jones, K.E., MacPhee, R.D.E., Beck, R.M.D., Grenyer, R., Price, S.A., Vos, R.A., Gittleman, J.L., Purvis, A.: The delayed rise of present-day mammals. Nature 446, 507–512 (2007)

    Article  Google Scholar 

  2. Cavendar-Bares, J., Ackerly, D.D., Baum, D., Bazzaz, F.A.: Phylogenetic overdispersion in the assembly of Floridian oak communities. American Naturalist 163, 823–843 (2004)

    Article  Google Scholar 

  3. Faith, D.P.: Conservation evaluation and phylogenetic diversity. Biological Conservation 61, 1–10 (1992)

    Article  Google Scholar 

  4. Faller, B., Pardi, F., Steel, M.: Distribution of phylogenetic diversity under random extinction. Journal of Theoretical Biology 251, 286–296 (2008)

    Article  Google Scholar 

  5. Felsenstein, J.: PHYLIP: Phylogeny inference package, version 3.57c. Distributed by the author, Department of Genetics. Univ. of Washington (1995)

    Google Scholar 

  6. Graham, C.H., Fine, P.V.A.: Phylogenetic beta diversity: linking ecological and evolutionary processes across space and time. Ecology Letters 11, 1265–1277 (2008)

    Article  Google Scholar 

  7. Hartmann, K., Steel, M.: Phylogenetic diversity: From combinatorics to ecology. In: Gascuel, O., Steel, M. (eds.) Reconstructing Evolution: New Mathematical and Computational Approaches. Oxford University Press (2007)

    Google Scholar 

  8. Kembel, S.W., Ackerly, D.D., Blomberg, S.P., Cornwell, W.K., Cowan, P.D., Helmus, M.R., Morlon, H., Webb, C.O.: Documentation for picante R package (2011)

    Google Scholar 

  9. Kissling, W.D., Eiserhardt, W.L., Baker, W.J., Borchsenius, F., Couvreur, T.L.P., Balslev, H., Svenning, J.-C.: Cenozoic imprints on the phylogenetic structure of palm species assemblages worldwide. Proc. National Academy of Sciences 109, 7379–7384 (2012)

    Article  Google Scholar 

  10. R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna (2010)

    Google Scholar 

  11. Steel, M.: Tools to construct and study big trees: A mathematical perspective. In: Hodkinson, T., Parnell, J., Waldren, S. (eds.) Reconstructing the Tree of Life: Taxonomy and Systematics of Species Rich Taxa, pp. 97–112. CRC Press (2007)

    Google Scholar 

  12. Webb, C.O., Ackerly, D.D., McPeek, M.A., Donoghue, M.J.: Phylogenies and community ecology. Annual Review of Ecology and Systematics 33, 475–505 (2002)

    Article  Google Scholar 

  13. Webb, C., Ackerly, D., Kembel, S.: Phylocom Users Manual, version 4.2 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tsirogiannis, C., Sandel, B., Cheliotis, D. (2012). Efficient Computation of Popular Phylogenetic Tree Measures. In: Raphael, B., Tang, J. (eds) Algorithms in Bioinformatics. WABI 2012. Lecture Notes in Computer Science(), vol 7534. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33122-0_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33122-0_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33121-3

  • Online ISBN: 978-3-642-33122-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics