Pathway-Based Functional Analysis of Metagenomes

  • Sivan Bercovici
  • Itai Sharon
  • Ron Y. Pinter
  • Tomer Shlomi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6044)


Metagenomic data enables the study of microbes and viruses through their DNA as retrieved directly from the environment in which they live. Functional analysis of metagenomes explores the abundance of gene families, pathways, and systems, rather than their taxonomy. Through such analysis researchers are able to identify those functional capabilities most important to organisms in the examined environment. Recently, a statistical framework for the functional analysis of metagenomes was described that focuses on gene families. Here we describe two pathway level computational models for functional analysis that take into account important, yet unaddressed issues such as pathway size, gene length and overlap in gene content among pathways. We test our models over carefully designed simulated data and propose novel approaches for performance evaluation. Our models significantly improve over current approach with respect to pathway ranking and the computations of relative abundance of pathways in environments.


Metagenomics functional analysis pathways Markov Chain Monte Carlo (MCMC) 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R., Chisholm, S.W., Karl, D.M.: Community Genomics Among Stratified Microbial Assemblages in the Ocean’s Interior. Science 311(5760), 496–503 (2006)CrossRefGoogle Scholar
  2. 2.
    Rusch, D.B., Halpern, A.L., Sutton, G., Heidelberg, K.B., Williamson, S., et al.: The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. 5(3), e77 (2007)Google Scholar
  3. 3.
    Yooseph, S., Sutton, G., Rusch, D.B., Halpern, A.L., Williamson, S.J., et al.: The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families. PLoS Biol. 5(3), e16 (2007)Google Scholar
  4. 4.
    Gill, S.R., Pop, M., Deboy, R.T., Eckburg, P.B., Turnbaugh, P.J., Samuel, B.S., Gordon, J.I., Relman, D.A., Fraser-Liggett, C.M., Nelson, K.E.: Metagenomic Analysis of the Human Distal Gut Microbiome. Science 312(5778), 1355–1359 (2006)CrossRefGoogle Scholar
  5. 5.
    Warnecke, F., Luginbuhl, P., Ivanova, N., Ghassemian, M., Richardson, T.H., et al.: Metagenomic and functional analysis of hindgut microbiota of a wood feeding higher termite. Nature 450, 560–565 (2007)CrossRefGoogle Scholar
  6. 6.
    Tyson, G.W., Chapman, J., Hugenholtz, P., Allen, E.E., Ram, R.J., et al.: Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428(6978), 37–43 (2004)CrossRefGoogle Scholar
  7. 7.
    Béjà, O., Aravind, L., Koonin, E.V., Suzuki, M.T., Hadd, A., Nguyen, L.P., Jovanovich, S.B., Gates, C.M., Feldman, R.A., Spudich, J.L., Spudich, E.N., DeLong, E.F.: Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science 289(5486), 1902–1906 (2000)CrossRefGoogle Scholar
  8. 8.
    Sharon, I., Alperovitch, A., Rohwer, F., Haynes, M., Glaser, F., et al.: Photosystem-I gene cassettes are present in marine virus genomes. Nature 461, 258–262 (2009)CrossRefGoogle Scholar
  9. 9.
    Raes, J., Foerstner, K.U., Bork, P.: Get the most out of your metagenome: computational analysis of environmental sequence data. Curr. Opin. Microbiol. 10(5), 490–498 (2007)CrossRefGoogle Scholar
  10. 10.
    Tatusov, R.L., Fedorova, N.D., Jackson, J.D., Jacobs, A.R., Kiryutin, B., et al.: The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4, 41 (2003)CrossRefGoogle Scholar
  11. 11.
    Finn, R.D., Mistry, J., Schuster-Böckler, B., Griffiths-Jones, S., Hollich, V., Lassmann, T., Moxon, S., Marshall, M., Khanna, A., Durbin, R., Eddy, S.R., Sonnhammer, E.L.L., Bateman, A.: Pfam: clans, web tools and services. Nucleic Acids Res. 34(Database Issue), D247–D251 (2006)Google Scholar
  12. 12.
    Haft, D.H., Selengut, J.D., White, O.: The TIGRFAMs database of protein families. Nucleic Acids Res. 31, 371–373 (2003)CrossRefGoogle Scholar
  13. 13.
    Kanehisa, M., Goto, S.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000)CrossRefGoogle Scholar
  14. 14.
    Caspi, R., Foerster, H., Fulcher, C.A., Kaipa, P., Krummenacker, M., et al.: The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 36(Database issue), D623–D631 (2008)Google Scholar
  15. 15.
    Overbeek, R., Begley, T., Butler, R.M., Choudhuri, J.V., Chuang, H.Y., et al.: The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res. 33, 5691–5702 (2005)CrossRefGoogle Scholar
  16. 16.
    Rodriguez-Brito, B., Rohwer, F., Edwards, R.A.: An application of statistics to compatative metagenomics. BMC Bioinformatics 20(7), 162 (2006)CrossRefGoogle Scholar
  17. 17.
    Markowitz, V.M., Szeto, E., Palaniappan, K., Grechkin, Y., Chu, K., Chen, I.A., Dubchak, I., Anderson, I., Lykidis, A., Mavromatis, K., Ivanova, N.N., Kyrpides, N.C.: The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions. Nucleic Acids Res. 36(Database Issue), D528–D533 (2008)Google Scholar
  18. 18.
    Sharon, I., Pati, A., Markowitz, V.M., Pinter, R.Y.: A statistical framework for the functional analysis of metagenomes. In: Batzoglou, S. (ed.) RECOMB 2009. LNCS, vol. 5541, pp. 496–511. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  19. 19.
    Lander, E.S., Waterman, M.S.: Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics 2(3), 231–239 (1988)CrossRefGoogle Scholar
  20. 20.
    Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)Google Scholar
  21. 21.
    Mollet, C., Drancourt, M., Raoult, D.: rpoB sequence analysis as a novel basis for bacterial identification. Mol. Microbiol. 26(5), 1005–1011 (1997)CrossRefGoogle Scholar
  22. 22.
    Venter, J.C., Remington, K., Heidelberg, J.F., Halpern, A.L., Rusch, D., et al.: Environmental genome shotgun sequencing of the Sargasso Sea. Science 304(5667), 66–74 (2004)CrossRefGoogle Scholar
  23. 23.
    Loy, A., Duller, S., Baranyi, C., Mußmann, M., Ott, J., et al.: Reverse dissimilatory sulfite reductase and other Dsr Proteins in sulfur-oxidizing bacteria: evolutionary history and suitability as phylogenetic markers. Environ. Microbiol. 11, 289–299 (2009)CrossRefGoogle Scholar
  24. 24.
    Yutin, N., Suzuki, M.T., Teeling, H., Weber, M., Venter, J.C., et al.: Assessing diversity and biogeography of aerobic anoxygenic phototrophic bacteria in surface waters of the Atlantic and Pacific Oceans using the Global Ocean Sampling expedition metagenomes. Environ. Microbiol. 9, 1464–1475 (2007)CrossRefGoogle Scholar
  25. 25.
    Howard, E.C., Henriksen, J.R., Buchan, A., Reisch, C.R., Bürgmann, H., et al.: Bacterial taxa that limit sulfur flux from the ocean. Science 314(5799), 649–652 (2006)CrossRefGoogle Scholar
  26. 26.
    Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., et al.: Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics 7, 57 (2006)CrossRefGoogle Scholar
  27. 27.
    Feingersch, R., Suzuki, M.T., Shmoish, M., Sharon, I., Sabehi, G., et al.: Microbial community genomics in eastern Mediterranean Sea surface waters. ISME J. (2009) doi:10.1038/ismej.2009.92Google Scholar
  28. 28.
    Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., et al.: Functional metagenomic profiling of nine biomes. Nature 452, 629–632 (2008)CrossRefGoogle Scholar
  29. 29.
    Ye, Y., Doak, T.G.: A parsimony approach to biological pathway reconstruction/inference for genomes and metagenomes. PLoS Comput. Biol. 5(8), e1000465 (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Sivan Bercovici
    • 1
  • Itai Sharon
    • 1
  • Ron Y. Pinter
    • 1
  • Tomer Shlomi
    • 1
  1. 1.Department of Computer ScienceTechnionHaifaIsrael

Personalised recommendations