Skip to main content

ParBaum: A Fast Program for Phylogenetic Tree Inference with Maximum Likelihood

  • Conference paper
Book cover High Performance Computing in Science and Engineering, Garching 2004

Abstract

Inference of large phylogenetic trees using elaborate statistical models is computationally extremely intensive. Thus, progress is primarily achieved via algorithmic innovation rather than by brute-force allocation of all available computational resources. We present simple heuristics which yield accurate trees for synthetic (simulated) as well as real data and improve execution time compared to the currently fastest programs. The new heuristics are implemented in a sequential program (RAxML) which is available as open source code. Furthermore, we present a non-deterministic parallel version of our algorithm which in some cases yielded super-linear speedups for computations with 1000 organisms. We compare sequential RAxML performance with the currently fastest and most accurate programs for phylogenetic tree inference based on statistical methods using 50 synthetic alignments and 9 real-world alignments comprising up to 1000 sequences. RAxML outperforms those programs for real-world data in terms of speed and final likelihood values.

This work is sponsored under the project ID ParBaum, within the framework of the “Competence Network for Technical, Scientific High Performance Computing in Bavaria” (KONWIHR)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Felsenstein, J.: Evolutionary Trees from DNA Sequences: A Maximum Likelihood Approach. In: J. Mol. Evol., 17:368–376, 1981.

    Article  Google Scholar 

  2. Guindon, S., and Gascuel, O.: A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. In: Syst. Biol., 52(5):696–704, 2003.

    Article  Google Scholar 

  3. Holder, M.T., and Lewis, P.O.: Phylogeny Estimation: Traditional and Bayesian Approaches. In: Nat. Rev. Gen., 4:275–284, 2003.

    Article  Google Scholar 

  4. Huelsenbeck, J.P., and Ronquist, F.: MRBAYES: Bayesian inference of phylogenetic trees. In: Bioinf., 17(8):754–5, 2001.

    Article  Google Scholar 

  5. Huelsenbeck, J.P., et al.: Potential Applications and Pitfalls of Bayesian Inference of Phylogeny. In: Syst. Biol., 51(5):673–688, 2002.

    Article  Google Scholar 

  6. Ludwig, W. et al.: ARB: A Software Environment for Sequence Data. In: Nucl. Acids Res., in press, 2003.

    Google Scholar 

  7. Olsen, G., et al.: fastdnaml: A Tool for Construction of Phylogenetic Trees of DNA Sequences using Maximum Likelihood. In: Comput. Appl. Biosci., 10:41–48, 1994.

    Google Scholar 

  8. PAML Manual: superconducting bcr.musc.edu/manuals, visited Nov 2003.

    Google Scholar 

  9. PAUP: superconducting paup.csit.fsu.edu, visited May 2003.

    Google Scholar 

  10. PHYLIP: superconducting evolution.genetics.washington.edu, visited Nov 2003.

    Google Scholar 

  11. RRZE: superconducting www.rrze.uni-erlangen.de, visited Oct 2003.

    Google Scholar 

  12. Stamatakis, A.P., et al: New Fast and Accurate Heuristics for Inference of Large Phylogenetic Trees. In: Proc. of IPDPS2004, to be published.

    Google Scholar 

  13. Stamatakis, A.P., et al: A Fast Program for Maximum Likelihood-based Inference of Large Phylogenetic Trees. In: Proc. of SAC’04, to be published.

    Google Scholar 

  14. Stamatakis, A.P., et al.: Accelerating Parallel Maximum Likelihood-based Phylogenetic Tree Computations using Subtree Equality Vectors. In: Proc. of SC2002, 2002.

    Google Scholar 

  15. Stewart, C. et al.: Parallel Implementation and Performance of fastdnaml-a Program for Maximum Likelihood Phylogenetic Inference. In: Proc. of SC2001, 2001.

    Google Scholar 

  16. Strimmer, K., Haeseler, A.v.: Quartet Puzzling: A Maximum-Likelihood Method for Reconstructing Tree Topologies. In: Mol. Biol. Evol., 13:964–969, 1996.

    Google Scholar 

  17. Williams, T.L., Moret, B.M.E.: An Investigation of Phylogenetic Likelihood Methods. In: Proc. of BIBE’03, 2003.

    Google Scholar 

  18. Tuffley, C, Steel, M.: Links between Maximum Likelihood and Maximum Parsimony under a Simple Sodel of Site Substitution. In: Bull. Math. Biol., 59(3):581–607, 1997.

    Article  MATH  Google Scholar 

  19. Wolf, M.J., et al.: TrExML: A Maximum Likelihood Program for Extensive Tree-space Exploration. In: Bioinf., 16(4):383–394, 2000.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Stamatakis, A.P., Ludwig, T., Meier, H. (2005). ParBaum: A Fast Program for Phylogenetic Tree Inference with Maximum Likelihood. In: Bode, A., Durst, F. (eds) High Performance Computing in Science and Engineering, Garching 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-28555-5_24

Download citation

Publish with us

Policies and ethics