Skip to main content

Bayesian Support for Evolution: Detecting Phylogenetic Signal in a Subset of the Primate Family

  • Conference paper
  • First Online:
Bayesian Inference and Maximum Entropy Methods in Science and Engineering (maxent 2017)

Abstract

The theory of evolution states that the diversity of species can be explained by descent with modification. Therefore, all living beings are related through a common ancestor. This evolutionary process must have left traces in our molecular composition. In this work, we present a randomization procedure in order to determine if a group of five species of the primate family, namely, macaque, guereza, orangutan, chimpanzee, and human, has retained these traces in its molecules. First, we present the randomization methodology through two toy examples, which allow to understand its logic. We then carry out a DNA data analysis to assess if the group of primates contains phylogenetic information which links them in a joint evolutionary history. This is carried out by monitoring a Bayesian measure, called marginal likelihood, which we estimate by using nested sampling. We found that it would be unusual to get the relationship observed in the data among these primate species if they had not shared a common ancestor. The results are in total agreement with the theory of evolution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Archie, J.W.: A randomization test for phylogenetic information in systematic data. Syst. Zool. 38, 239–252 (1989)

    Article  Google Scholar 

  2. Baum, D.A., Smith, S.D.: Tree Thinking: An Introduction to Phylogenetic Biology. Roberts and Company Publishers, Greenwood Village (2012)

    Google Scholar 

  3. Darwin, C.: On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life. J. Murray, London (1859)

    Google Scholar 

  4. Maturana Russel, P., Brewer, B.J., Klaere, S., Bouckaert, R.: Model selection and parameter inference in phylogenetics using nested sampling. arXiv:1703.05471v3

  5. Penny, D., Foulds, L.R., Hendy, M.D.: Testing the theory of evolution by comparing phylogenetic trees constructed from five different protein sequences. Nature 297, 197–200 (1982)

    Article  Google Scholar 

  6. R Core Team.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2015)

    Google Scholar 

  7. Roos, C., Zinner, D., Kubatko, L.S., Schwarz, C., Yang, M., Meyer, D., Nash, S.D., Xing, J., Batzer, M.A., Brameier, M., Leendertz, F.H., Ziegler, T., Perwitasari-Farajallah, D., Nadler, T., Walter, L., Osterholz, M.: Nuclear versus mitochondrial DNA: evidence for hybridization in colobine monkeys. BMC Evol. Biol. 11, 77 (2011)

    Article  Google Scholar 

  8. Skilling, J.: Nested sampling for general Bayesian computation. Bayesian Anal. 1, 833–860 (2006)

    Article  MathSciNet  Google Scholar 

  9. Yang, Z.: Molecular Evolution: A Statistical Approach. Oxford University Press, Oxford (2014)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Patricio Maturana Russel .

Editor information

Editors and Affiliations

Appendices

Appendix 1

We analyze the dataset assuming a \(\mathrm{GTR}+\varGamma _4\) model and consider the following prior distributions on the parameters involved in the analysis:

  • Branch lengths: \(t_i|\mu \sim \) Exp\((1/\mu )\), for \(i =1,\ldots ,8,\) with \(\mu \sim \) Inverse-Gamma(3,0.2).

  • Relative rates: \(q_i|\phi \sim \) Exp\((\phi )\), for \(i =1,\ldots ,5,\) with \(\phi \sim \) Exp(1).

  • Base frequencies: \(\pi \sim \) Dirichlet(1,1,1,1).

  • Gamma shape parameter: \(\lambda \sim \) Gamma(0.5,1).

For more information about the parameters involved in the phylogenetic analysis, see [9].

Appendix 2

Nested sampling [8] is a Bayesian algorithm to estimate mainly the marginal likelihood. It requires a tunning parameter called active points. The precision of the estimate depends on the number of active points. The higher it is, the more accurate the estimate and the higher the computational cost are.

To estimate the observed marginal likelihood, we use 100 active points. This yields a standard deviation of 0.73 of the log-marginal likelihood estimate. For the 1000 randomized datasets, we use five active points in order to get a quick picture of their log-marginal likelihood distribution.

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Maturana Russel, P. (2018). Bayesian Support for Evolution: Detecting Phylogenetic Signal in a Subset of the Primate Family. In: Polpo, A., Stern, J., Louzada, F., Izbicki, R., Takada, H. (eds) Bayesian Inference and Maximum Entropy Methods in Science and Engineering. maxent 2017. Springer Proceedings in Mathematics & Statistics, vol 239. Springer, Cham. https://doi.org/10.1007/978-3-319-91143-4_20

Download citation

Publish with us

Policies and ethics