Synopsis
Understanding protein structure and function requires understanding of the modular nature of proteins and their native folds. Most proteins are made up of one to several sequence segments or domains that share a common core fold and often function and in multidomain proteins are connected by typically unstructured linker sequences. Invention, duplication, sharing, and remodeling of domains have been constant processes throughout proteome evolution. Domains are classified by fold, conserved sequence, and conserved function into fold families and fold superfamilies. Surprisingly, the sum of domain fold space is highly limited and appears to be fully represented by as few as 1,200 folds, about 2,000 fold superfamilies, and roughly 4,000 fold families. Examination of fold family invention, loss, and sharing has revealed much about the history and associations of the three superkingdoms of life. Fold family diversification begins early in evolutionary history with rapid invention...
References
Caetano-Anolles G, Wang M, Caetano-Anolles D, Mittenthal JE (2009) The origin, evolution and structure of the protein world. Biochem J 417:621–637
CATH. http://www.cathdb.info/. Accessed 30 Apr 2014
InterPro. https://www.ebi.ac.uk/interpro/. Accessed 30 Apr 2014
Kim KM, Caetano-Anolles G (2012) The evolutionary history of protein fold families and proteomes confirms that the archaeal ancestor is more ancient than the ancestors of other superkingdoms. BMC Evol Biol. doi:10.1186/1471-2148-12-13
Kurland CG, Canbäck B, Berg OG (2007) The origins of modern proteomes. Biochimie 89:1454–1463
Marsden RL, Orengo CA (2008) The classification of protein domains. Methods Mol Biol 453:123–146
Marsden RL, Lee D, Maibaum M, Yeats C, Oreno CA (2006) Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res 34:1066–1080
Nasir A, Kim KM, Caetano-Anolles G (2014) Global patterns of protein domain gain and loss in superkingdoms. PLoS Comput Biol 10:e1003452
Pfam. http://pfam.sanger.ac.uk/. Accessed 30 Apr 2014
Reeves GA, Dallman TJ, Redfern OC, Akpor A, Orengo CA (2006) Structural diversity of domain superfamilies in the CATH database. J Mol Biol 360:725–741
SCOP. http://scop.mrc-lmb.cam.ac.uk/scop/. Accessed 30 Apr 2014
SCOP2. http://scop2.mrc-lmb.cam.ac.uk/. Accessed 30 Apr 2014
SMART. http://smart.embl-heidelberg.de/. Accessed 30 Apr 2014
Wang M, Yafremava LS, Caetano-Anolles D, Mittenthal JE, Caetano-Anolles G (2007) Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world. Genome Res 17:1572–1585
Wang M, Kurland CG, Caetano-Anolles G (2011a) Reductive evolution of proteomes and protein structures. Proc Natl Acad Sci U S A 108:11954–11958
Wang M, Jiang Y, Kim KM, Qu G, Ji H, Mittenthal JE, Zhang H, Caetano-Anolles G (2011b) A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol Biol Evol 28:567–582
Zhang Y, Hubner IA, Arakaki AK, Shakhnovich E, Skolnick J (2006) On the origin and highly likely completeness of single-domain protein structures. Proc Natl Acad Sci U S A 103:2605–2610
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media New York
About this entry
Cite this entry
Vandergon, T.L. (2014). Protein Domain Structure Evolution. In: Bell, E. (eds) Molecular Life Sciences. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6436-5_19-2
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6436-5_19-2
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4614-6436-5
eBook Packages: Springer Reference Biomedicine and Life SciencesReference Module Biomedical and Life Sciences