Abstract
The tree of life is the classical representation of the evolutionary relationships between existent species. A tree is appropriate to display the divergence of species through mutation, i.e., by vertical descent. However, lateral gene transfer (LGT) is excluded from such representations. When LGT contribution to genome evolution cannot be neglected (e.g., for prokaryotes and mobile genetic elements), the tree becomes misleading. Networks appear as an intuitive way to represent both vertical and horizontal relationships, while overlapping groups within such graphs are more suitable for their classification. Here, we describe a method to represent both vertical and horizontal relationships. We start with a set of genomes whose coded proteins have been grouped into families based on sequence similarity. Next, all pairs of genomes are compared, counting the number of proteins classified into the same family. From this comparison, we derive a weighted graph where genomes with a significant number of similar proteins are linked. Finally, we apply a two-step clustering of this graph to produce a classification where nodes can be assigned to multiple clusters. The procedure can be performed using the Network Analysis Tools (NeAT) website.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Darwin C. (1859) On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life. John Murray, London.
Doolittle WF. (1999) Phylogenetic classification and the universal tree. Science, 284:2124–2129.
Kunin V, Goldovsky L, Darzentas N, Ouzounis CA. (2005) The net of life: reconstructing the microbial phylogenetic network. Genome Res, 15:954–959.
Lima-Mendez G, Van Helden J, Toussaint A, Leplae R. (2008) Reticulate representation of evolutionary and functional relationships between phage genomes. Mol Biol Evol, 25:762–777.
Brohee S, Faust K, Lima-Mendez G, Sand O, Janky R, Vanderstocken G, Deville Y, van Helden J. (2008) NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways. Nucleic Acids Res, 36:W444–W451.
van Dongen S. (2000) Graph Clustering by Flow Simulation. In Centre for Mathematics and Computer Science. Utrecht, Amsterdam, 173.
Hendrix RW, Smith MC, Burns RN, Ford ME, Hatfull GF. (1999) Evolutionary relationships among diverse bacteriophages and prophages: all the world’s a phage. Proc Natl Acad Sci U S A, 96:2192–2197.
Rohwer F, Edwards R. (2002) The Phage Proteomic Tree: a genome-based taxonomy for phage. J Bacteriol, 184:4529–4535.
Leplae R, Lima-Mendez G, Toussaint A. ACLAME: a CLAssification of Mobile genetic Elements, update 2010. Nucleic Acids Res, 38:D57–D61.
Enright AJ, Van Dongen S, Ouzounis CA. (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res, 30:1575–1584.
Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, Bork P, von Mering C. (2009) STRING 8 – a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res, 37:D412–D416.
Allison GE, Angeles D, Tran-Dinh N, Verma NK. (2002) Complete genomic sequence of SfV, a serotype-converting temperate bacteriophage of Shigella flexneri. J Bacteriol, 184:1974–1987.
Recktenwald J, Schmidt H. (2002) The nucleotide sequence of Shiga toxin (Stx) 2e-encoding phage phiP27 is not related to other Stx phage genomes, but the modular genetic structure is conserved. Infect Immun, 70:1896–1908.
Mmolawa PT, Schmieger H, Heuzenroeder MW. (2003) Bacteriophage ST64B, a genetic mosaic of genes from diverse sources isolated from Salmonella enterica serovar typhimurium DT 64. J Bacteriol, 185:6481–6485.
Dagan T, Artzy-Randrup Y, Martin W. (2008) Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution. Proc Natl Acad Sci USA, 105(29), 10039–10044.
Acknowledgments
I wish to thank Raphael Leplae, Sylvain Brohée and Jacques van Helden for the maintenance of the ACLAME database and NeAT services, respectively, as well as Ariane Toussaint and Denis Thieffry for critical reading of the manuscript and testing the protocol.
Work on the ACLAME project in the BiGRe laboratory was financed by the European Space Agency (ESA-PRODEX) and the Belgian Science Policy (Belspo) through the EXANAM project (PRODEX agreements No. C90358), the Fonds de la Recherche Scientifique Médicale (FRSM), the Belgian Program on Interuniversity Attraction Poles, initiated by the Belgian Federal Science Policy Office, project P6/25 (BioMaGNet), and the BioSapiens Network of excellence funded under the sixth Framework program of the European Communities (LSHG-CT-2003-503265).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Lima-Mendez, G. (2012). Reticulate Classification of Mosaic Microbial Genomes Using NeAT Website. In: van Helden, J., Toussaint, A., Thieffry, D. (eds) Bacterial Molecular Networks. Methods in Molecular Biology, vol 804. Springer, New York, NY. https://doi.org/10.1007/978-1-61779-361-5_5
Download citation
DOI: https://doi.org/10.1007/978-1-61779-361-5_5
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-61779-360-8
Online ISBN: 978-1-61779-361-5
eBook Packages: Springer Protocols