Skip to main content

Protein Domain Structure Evolution

  • Living reference work entry
  • First Online:
Molecular Life Sciences

Synopsis

Understanding protein structure and function requires understanding of the modular nature of proteins and their native folds. Most proteins are made up of one to several sequence segments or domains that share a common core fold and often function and in multidomain proteins are connected by typically unstructured linker sequences. Invention, duplication, sharing, and remodeling of domains have been constant processes throughout proteome evolution. Domains are classified by fold, conserved sequence, and conserved function into fold families and fold superfamilies. Surprisingly, the sum of domain fold space is highly limited and appears to be fully represented by as few as 1,200 folds, about 2,000 fold superfamilies, and roughly 4,000 fold families. Examination of fold family invention, loss, and sharing has revealed much about the history and associations of the three superkingdoms of life. Fold family diversification begins early in evolutionary history with rapid invention...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • Caetano-Anolles G, Wang M, Caetano-Anolles D, Mittenthal JE (2009) The origin, evolution and structure of the protein world. Biochem J 417:621–637

    Article  CAS  PubMed  Google Scholar 

  • CATH. http://www.cathdb.info/. Accessed 30 Apr 2014

  • InterPro. https://www.ebi.ac.uk/interpro/. Accessed 30 Apr 2014

  • Kim KM, Caetano-Anolles G (2012) The evolutionary history of protein fold families and proteomes confirms that the archaeal ancestor is more ancient than the ancestors of other superkingdoms. BMC Evol Biol. doi:10.1186/1471-2148-12-13

    Google Scholar 

  • Kurland CG, Canbäck B, Berg OG (2007) The origins of modern proteomes. Biochimie 89:1454–1463

    Article  CAS  PubMed  Google Scholar 

  • Marsden RL, Orengo CA (2008) The classification of protein domains. Methods Mol Biol 453:123–146

    Article  CAS  PubMed  Google Scholar 

  • Marsden RL, Lee D, Maibaum M, Yeats C, Oreno CA (2006) Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res 34:1066–1080

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  • Nasir A, Kim KM, Caetano-Anolles G (2014) Global patterns of protein domain gain and loss in superkingdoms. PLoS Comput Biol 10:e1003452

    Article  PubMed Central  PubMed  Google Scholar 

  • Pfam. http://pfam.sanger.ac.uk/. Accessed 30 Apr 2014

  • Reeves GA, Dallman TJ, Redfern OC, Akpor A, Orengo CA (2006) Structural diversity of domain superfamilies in the CATH database. J Mol Biol 360:725–741

    Article  CAS  PubMed  Google Scholar 

  • SCOP. http://scop.mrc-lmb.cam.ac.uk/scop/. Accessed 30 Apr 2014

  • SCOP2. http://scop2.mrc-lmb.cam.ac.uk/. Accessed 30 Apr 2014

  • SMART. http://smart.embl-heidelberg.de/. Accessed 30 Apr 2014

  • Wang M, Yafremava LS, Caetano-Anolles D, Mittenthal JE, Caetano-Anolles G (2007) Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world. Genome Res 17:1572–1585

    Article  PubMed Central  PubMed  Google Scholar 

  • Wang M, Kurland CG, Caetano-Anolles G (2011a) Reductive evolution of proteomes and protein structures. Proc Natl Acad Sci U S A 108:11954–11958

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  • Wang M, Jiang Y, Kim KM, Qu G, Ji H, Mittenthal JE, Zhang H, Caetano-Anolles G (2011b) A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol Biol Evol 28:567–582

    Article  CAS  PubMed  Google Scholar 

  • Zhang Y, Hubner IA, Arakaki AK, Shakhnovich E, Skolnick J (2006) On the origin and highly likely completeness of single-domain protein structures. Proc Natl Acad Sci U S A 103:2605–2610

    Article  CAS  PubMed Central  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thomas L. Vandergon .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this entry

Cite this entry

Vandergon, T.L. (2014). Protein Domain Structure Evolution. In: Bell, E. (eds) Molecular Life Sciences. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6436-5_19-2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-6436-5_19-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, New York, NY

  • Online ISBN: 978-1-4614-6436-5

  • eBook Packages: Springer Reference Biomedicine and Life SciencesReference Module Biomedical and Life Sciences

Publish with us

Policies and ethics