Abstract
The theory of evolution states that the diversity of species can be explained by descent with modification. Therefore, all living beings are related through a common ancestor. This evolutionary process must have left traces in our molecular composition. In this work, we present a randomization procedure in order to determine if a group of five species of the primate family, namely, macaque, guereza, orangutan, chimpanzee, and human, has retained these traces in its molecules. First, we present the randomization methodology through two toy examples, which allow to understand its logic. We then carry out a DNA data analysis to assess if the group of primates contains phylogenetic information which links them in a joint evolutionary history. This is carried out by monitoring a Bayesian measure, called marginal likelihood, which we estimate by using nested sampling. We found that it would be unusual to get the relationship observed in the data among these primate species if they had not shared a common ancestor. The results are in total agreement with the theory of evolution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Archie, J.W.: A randomization test for phylogenetic information in systematic data. Syst. Zool. 38, 239–252 (1989)
Baum, D.A., Smith, S.D.: Tree Thinking: An Introduction to Phylogenetic Biology. Roberts and Company Publishers, Greenwood Village (2012)
Darwin, C.: On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life. J. Murray, London (1859)
Maturana Russel, P., Brewer, B.J., Klaere, S., Bouckaert, R.: Model selection and parameter inference in phylogenetics using nested sampling. arXiv:1703.05471v3
Penny, D., Foulds, L.R., Hendy, M.D.: Testing the theory of evolution by comparing phylogenetic trees constructed from five different protein sequences. Nature 297, 197–200 (1982)
R Core Team.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2015)
Roos, C., Zinner, D., Kubatko, L.S., Schwarz, C., Yang, M., Meyer, D., Nash, S.D., Xing, J., Batzer, M.A., Brameier, M., Leendertz, F.H., Ziegler, T., Perwitasari-Farajallah, D., Nadler, T., Walter, L., Osterholz, M.: Nuclear versus mitochondrial DNA: evidence for hybridization in colobine monkeys. BMC Evol. Biol. 11, 77 (2011)
Skilling, J.: Nested sampling for general Bayesian computation. Bayesian Anal. 1, 833–860 (2006)
Yang, Z.: Molecular Evolution: A Statistical Approach. Oxford University Press, Oxford (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix 1
We analyze the dataset assuming a \(\mathrm{GTR}+\varGamma _4\) model and consider the following prior distributions on the parameters involved in the analysis:
-
Branch lengths: \(t_i|\mu \sim \) Exp\((1/\mu )\), for \(i =1,\ldots ,8,\) with \(\mu \sim \) Inverse-Gamma(3,0.2).
-
Relative rates: \(q_i|\phi \sim \) Exp\((\phi )\), for \(i =1,\ldots ,5,\) with \(\phi \sim \) Exp(1).
-
Base frequencies: \(\pi \sim \) Dirichlet(1,1,1,1).
-
Gamma shape parameter: \(\lambda \sim \) Gamma(0.5,1).
For more information about the parameters involved in the phylogenetic analysis, see [9].
Appendix 2
Nested sampling [8] is a Bayesian algorithm to estimate mainly the marginal likelihood. It requires a tunning parameter called active points. The precision of the estimate depends on the number of active points. The higher it is, the more accurate the estimate and the higher the computational cost are.
To estimate the observed marginal likelihood, we use 100 active points. This yields a standard deviation of 0.73 of the log-marginal likelihood estimate. For the 1000 randomized datasets, we use five active points in order to get a quick picture of their log-marginal likelihood distribution.
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Maturana Russel, P. (2018). Bayesian Support for Evolution: Detecting Phylogenetic Signal in a Subset of the Primate Family. In: Polpo, A., Stern, J., Louzada, F., Izbicki, R., Takada, H. (eds) Bayesian Inference and Maximum Entropy Methods in Science and Engineering. maxent 2017. Springer Proceedings in Mathematics & Statistics, vol 239. Springer, Cham. https://doi.org/10.1007/978-3-319-91143-4_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-91143-4_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91142-7
Online ISBN: 978-3-319-91143-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)