Skip to main content

MrBayes for Phylogenetic Inference Using Protein Data on a GPU Cluster

  • Conference paper
  • First Online:
Book cover Algorithms and Architectures for Parallel Processing (ICA3PP 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9530))

  • 1837 Accesses

Abstract

MrBayes is a widely used software for Bayesian phylogenetic inference: we input biological sequence data from various taxonomic groups, and MrBayes returns its estimate of the phylogenetic tree which gave rise to those taxa. This paper presents ta(MC)\(^{3}\), based on its predecessor a(MC)\(^{3}\), which, for protein datasets, improves computational efficiency and overcomes major obstacles in analyzing larger datasets on HPCs with multiple Graphics Processing Units (GPUs). The major improvements are (a) a new task mapping strategy, (b) the use of Kahan summation to resolve non-convergence issues, and (c) the introduction of 64-bit variables. We evaluate ta(MC)\(^{3}\) on real-world protein datasets both on a desktop server and the Tianhe-1A supercomputer. With a single GPU, ta(MC)\(^{3}\) is nearly 90 times faster compared with the serial version of MrBayes, up to around 9 times faster than MrBayes utilizing a GPU via the BEAGLE library, and up to 2.5 times faster than a(MC)\(^{3}\). On larger datasets with 64 nodes (GPUs) on Tianhe-1A, ta(MC)\(^{3}\) is capable of obtaining \(1000+\) speedup vs. serial MrBayes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.top500.org/lists/2010/11/.

References

  1. Altekar, G., Dwarkadas, S., Huelsenbeck, F., Ronquist, J.P.: Parallel metropolis coupled markov chain monte carlo for bayesian phylogenetic inference. Bioinformatics 20, 407–415 (2004)

    Article  Google Scholar 

  2. Bao, J., Xia, J., Zhou, J., Liu, X.G., Wang, G.: Efficient implementation of MrBayes on multi-GPU. Mol. Biol. Evol. 30, 1471–1479 (2013)

    Article  Google Scholar 

  3. Farber, R.: CUDA Application Design and Development. Morgan Kaufmann, San Francisco (2011)

    Google Scholar 

  4. Felsenstein, J.: Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368–376 (1981)

    Article  Google Scholar 

  5. Kahan, W.: Pracniques: further remarks on reducing truncation errors. Commun. ACM 8(1), 40 (1965). http://doi.acm.org/10.1145/363707.363723

    Article  Google Scholar 

  6. Larget, B., Simon, D.L.: Markov chain monte carlo algorithms for the bayesian analysis of phylogenetic trees. Mol. Biol. Evol. 16, 750–759 (1999)

    Article  Google Scholar 

  7. Li, S., Pearl, D.K., Doss, H.: Phylogenetic tree construction using markov chain monte carlo. J. Am. Statist. Assoc. 95, 493–508 (2000)

    Article  Google Scholar 

  8. Mau, B., Newton, M.A.: Phylogenetic inference for binary data on dendrograms using markov chain monte carlo. J. Comp. Graph. Stat. 6, 122–131 (1997)

    Google Scholar 

  9. NVIDIA: CUDA C Programming Guide (2013)

    Google Scholar 

  10. Pang, S., Stones, R.J., Ren, M.M., Liu, X.G., Wang, G., Xia, H., Wu, H.Y., Liu, Y., Xie, Q.: GPU MrBayes v3.1: GPU MrBayes on graphics processing units for protein sequence data. Mol. Biol. Evol. 32(9), 2496–2497 (2015)

    Article  Google Scholar 

  11. Pratas, F., Trancoso, P., Stamatakis, A., Sousa, L.: Fine-grain parallelism using multi-core, Cell/BE, and GPU systems: accelerating the phylogenetic likelihood function. In: 42nd International Conference on Parallel Processing, pp. 9–17 (2009)

    Google Scholar 

  12. Rannala, B., Yang, Z.: Probability distribution of molecular evolutionary trees: a new method of phylogenetic inference. J. Mol. Evol. 43, 304–311 (1996)

    Article  Google Scholar 

  13. Saitou, N., Nei, M.: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987)

    Google Scholar 

  14. Schmidt, H., Strimmer, K., Vingron, M., Haeseler, A.: Tree-puzzle: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18, 502–504 (2002)

    Article  Google Scholar 

  15. Thuiller, W., Lavergne, S., Roquet, C., Boulangeat, I., Lafourcade, B., Araujo, M.B.: Parallel algorithms for bayesian phylogenetic inference. J. Parallel Distrib. Comput. 63, 707–718 (2003)

    Article  Google Scholar 

  16. Xie, Q., Bu, W., Zheng, L.: The bayesian phylogenetic analysis of the 18s RNA sequences from the main lineages of trichophora (insecta: Heteroptera:pentatomomorpha). Mol. Biol. Evol. 34, 448–451 (2005)

    Google Scholar 

  17. Yang, Z.: Phylogenetic analysis using parsimony and likelihood methods. J. Mol. Evol. 42(2), 294–307 (1996)

    Article  Google Scholar 

  18. Zhou, J., Liu, X.G., Stones, D.S., Xie, Q., Wang, G.: MrBayes on a graphics processing unit. Bioinformatics 27, 1255–1261 (2011)

    Article  Google Scholar 

Download references

Acknowledgements

A biology-focused version of this paper has been published [10]. This work is partially supported by NSF of China (grant numbers: 61373018, 11301288), Program for New Century Excellent Talents in University (grant number: NCET130301) and the Fundamental Research Funds for the Central Universities (grant number: 65141021). Stones was supported by her NSF China Research Fellowship for International Young Scientists (grant number: 11450110409). We would also like to thank Hongju Xia, Jianfu Zhou, Jie Bao and Prof. Qiang Xie for their valuable input.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ming-ming Ren .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Pang, S., Stones, R.J., Ren, Mm., Wang, G., Liu, X. (2015). MrBayes for Phylogenetic Inference Using Protein Data on a GPU Cluster. In: Wang, G., Zomaya, A., Martinez, G., Li, K. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2015. Lecture Notes in Computer Science(), vol 9530. Springer, Cham. https://doi.org/10.1007/978-3-319-27137-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27137-8_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27136-1

  • Online ISBN: 978-3-319-27137-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics