Abstract
A combined genomics and in situ proteomics approach can be used to determine and classify the relatedness of organisms. The common set of proteins shared within a group of genomes is encoded by the “core” set of genes, which is increasingly recognized as a metric for parsing viral and bacterial species. These can be described by the concept of a “pan-genome”, which consists of this “core” set and a “dispensable” set, i.e., genes found in one or more but not all organisms in the grouping. “CoreGenesUniqueGenes” (CGUG) is a web-based tool that determines this core set of proteins in a set of genomes as well as parses the dispensable set of unique proteins in a pair of viral or small bacterial genomes. This proteome-based methodology is validated using bacteriophages, aiding the reevaluation of current classifications of bacteriophages. The utility of CGUG in the analysis of small bacterial genomes and the annotation of hypothetical proteins is also presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zafar N, Mazumder R, Seto D (2002) CoreGenes: a computational tool for identifying and cataloging “core” genes in a set of small genomes. BMC bioinformatics 3, 12
Fauquet CM, Fargette D (2005) International Committee on Taxonomy of Viruses and the 3,142 unassigned species. Virology journal 2, 64
Nelson D (2004) Phage taxonomy: we agree to disagree. Journal of bacteriology 186, 7029–7031
Rohwer F, Edwards R (2002) The Phage Proteomic Tree: a genome-based taxonomy for phage. Journal of bacteriology 184, 4529–4535
Koonin EV (2003) Comparative genomics, minimal gene-sets and the last universal common ancestor. Nature reviews 1, 127–136
Lerat E, Daubin V, Moran NA (2003) From gene trees to organismal phylogeny in prokaryotes: the case of the gamma-Proteobacteria. PLoS biology 1, E19
Lavigne R, Seto D, Mahadevan P, Ackermann H-W, Kropinski AM (2008) Unifying classical and molecular taxonomic classification: analysis of the Podoviridae using BLASTP-based tools. Research in microbiology 159, 406–414
Mazumder R, Kolaskar A, Seto D (2001) GeneOrder: comparing the order of genes in small genomes. Bioinformatics (Oxford, England) 17, 162–166
Hendrix RW, Smith MC, Burns RN, Ford ME, Hatfull GF (1999) Evolutionary relationships among diverse bacteriophages and prophages: all the world’s a phage. Proceedings of the National Academy of Sciences of the United States of America 96, 2192–2197
Lavigne R, Darius P, Summer EJ, Seto D, Mahadevan P, Nilsson AS, Ackermann HW, Kropinski AM (2009) Classification of Myoviridae bacteriophages using protein sequence similarity. BMC microbiology 9, 224
Yoder-Himes DR, Chain PS, Zhu Y, Wurtzel O, Rubin EM, Tiedje JM, Sorek R (2009) Mapping the Burkholderia cenocepacia niche response via high-throughput sequencing. Proceedings of the National Academy of Sciences of the United States of America 106, 3976–3981
Cai CZ, Han LY, Ji ZL, Chen X, Chen YZ (2003) SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence. Nucleic acids research 31, 3692–3697
Sadowsky MJ, Tong Z, de Souza M, Wackett LP (1998) AtzC is a new member of the amidohydrolase protein superfamily and is homologous to other atrazine-metabolizing enzymes. Journal of bacteriology 180, 152–158
Porter DJ, Austin EA (1993) Cytosine deaminase. The roles of divalent cations in catalysis. Journal of biological chemistry 268, 24005–24011
Kelley LA, Sternberg MJE (2009) Protein structure prediction on the web: a case study using the Phyre server. Nature protocols 4, 363–371
Acknowledgments
We thank Drs. Andrew Kropinski and Rob Lavigne for their very helpful suggestions of features to add to CoreGenes. We thank Jason Seto for editorial comments and Chris Ryan for maintaining the server on which the software is hosted. This paper is based on Mahadevan and Seto (BIOCOMP ’09).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this paper
Cite this paper
Mahadevan, P., Seto, D. (2010). Taxonomic Parsing of Bacteriophages Using Core Genes and In Silico Proteome-Based CGUG and Applications to Small Bacterial Genomes. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_43
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5913-3_43
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5912-6
Online ISBN: 978-1-4419-5913-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)