Modeling of Protein Tertiary and Quaternary Structures Based on Evolutionary Information

Studer, Gabriel; Tauriello, Gerardo; Bienert, Stefan; Waterhouse, Andrew Mark; Bertoni, Martino; Bordoli, Lorenza; Schwede, Torsten; Lepore, Rosalba

doi:10.1007/978-1-4939-8736-8_17

Gabriel Studer³,
Gerardo Tauriello³,
Stefan Bienert³,
Andrew Mark Waterhouse³,
Martino Bertoni³,
Lorenza Bordoli³,
Torsten Schwede³ &
…
Rosalba Lepore³

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1851))

2910 Accesses
12 Citations

Abstract

Proteins are subject to evolutionary forces that shape their three-dimensional structure to meet specific functional demands. The knowledge of the structure of a protein is therefore instrumental to gain information about the molecular basis of its function. However, experimental structure determination is inherently time consuming and expensive, making it impossible to follow the explosion of sequence data deriving from genome-scale projects. As a consequence, computational structural modeling techniques have received much attention and established themselves as a valuable complement to experimental structural biology efforts. Among these, comparative modeling remains the method of choice to model the three-dimensional structure of a protein when homology to a protein of known structure can be detected.

The general strategy consists of using experimentally determined structures of proteins as templates for the generation of three-dimensional models of related family members (targets) of which the structure is unknown. This chapter provides a description of the individual steps needed to obtain a comparative model using SWISS-MODEL, one of the most widely used automated servers for protein structure homology modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Guex N, Peitsch MC, Schwede T (2009) Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: a historical perspective. Electrophoresis 30 Suppl 1:S162–S173
Article Google Scholar
Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234:779–815
Article CAS Google Scholar
Chothia C, Lesk AM (1986) The relation between the divergence of sequence and structure in proteins. EMBO J 5:823–826
Article CAS Google Scholar
Arnold K, Bordoli L, Kopp J et al (2006) The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22:195–201
Article CAS Google Scholar
Biasini M, Bienert S, Waterhouse A et al (2014) SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res 42:W252–W258
Article CAS Google Scholar
Kiefer F, Arnold K, Kunzli M et al (2009) The SWISS-MODEL repository and associated resources. Nucleic Acids Res 37:D387–D392
Article CAS Google Scholar
Waterhouse A, Bertoni M, Bienert S et al (2018) SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research Res 46(W1):W296–W303
Article Google Scholar
Kryshtafovych A, Venclovas C, Fidelis K et al (2005) Progress over the first decade of CASP experiments. Proteins 61(Suppl 7):225–236
Article CAS Google Scholar
Berman H, Henrick K, Nakamura H et al (2007) The worldwide protein data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res 35:D301–D303
Article CAS Google Scholar
Altschul SF, Madden TL, Schaffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Article CAS Google Scholar
Remmert M, Biegert A, Hauser A et al (2011) HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods 9:173–175
Article Google Scholar
Jones DT (1999) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292:195–202
Article CAS Google Scholar
Sillitoe I, Cuff AL, Dessailly BH et al (2013) New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res 41:D490–D498
Article CAS Google Scholar
Aloy P, Ceulemans H, Stark A et al (2003) The relationship between sequence and interaction divergence in proteins. J Mol Biol 332:989–998
Article CAS Google Scholar
Bertoni M, Kiefer F, Biasini M et al (2017) Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology. Sci Rep 7:10480
Article Google Scholar
Marcatili P, Olimpieri PP, Chailyan A et al (2014) Antibody modeling using the prediction of immunoglobulin structure (PIGS) web server [corrected]. Nat Protoc 9:2771–2783
Article CAS Google Scholar
Lepore R, Olimpieri PP, Messih MA et al (2017) PIGSPro: prediction of immunoGlobulin structures v2. Nucleic Acids Res 45:W17
Article CAS Google Scholar
Biasini M, Schmidt T, Bienert S et al (2013) OpenStructure: an integrated software framework for computational structural biology. Acta Crystallogr D Biol Crystallogr 69:701–709
Article CAS Google Scholar
Fiser A (2010) Template-based protein structure modeling. Methods Mol Biol 673:73–94
Article CAS Google Scholar
Choi Y, Deane CM (2010) FREAD revisited: accurate loop structure prediction using a database search algorithm. Proteins 78:1431–1440
CAS PubMed Google Scholar
Liang S, Zhang C, Zhou Y (2014) LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J Comput Chem 35:335–341
Article CAS Google Scholar
Messih MA, Lepore R, Tramontano A (2015) LoopIng: a template-based tool for predicting the structure of protein loops. Bioinformatics 31:3767–3772
PubMed PubMed Central Google Scholar
Canutescu AA, Dunbrack RL Jr (2003) Cyclic coordinate descent: a robotics algorithm for protein loop closure. Protein science: a publication of the protein. Society 12:963–972
CAS Google Scholar
Sippl MJ (1990) Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213:859–883
Article CAS Google Scholar
Shapovalov MV, Dunbrack RL Jr (2011) A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 19:844–858
Article CAS Google Scholar
Krivov GG, Shapovalov MV, Dunbrack RL Jr (2009) Improved prediction of protein side-chain conformations with SCWRL4. Proteins 77:778–795
Article CAS Google Scholar
Xu J (2005) Rapid protein side-chain packing via tree decomposition. In: Miyano S, Mesirov J, Kasif S, Istrail S, Pevzner PA, Waterman M (eds) Research in computational molecular biology: 9th Annual International Conference, RECOMB 2005, Cambridge, MA, USA, May 14–18, 2005. Proceedings. Springer Berlin, Heidelberg, pp 423–439
Chapter Google Scholar
Mackerell AD Jr, Feig M, Brooks CL 3rd (2004) Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem 25:1400–1415
Article CAS Google Scholar
Eastman P, Swails J, Chodera JD et al (2017) OpenMM 7: rapid development of high performance algorithms for molecular dynamics. PLoS Comput Biol 13:e1005659
Article Google Scholar
Baker D, Sali A (2001) Protein structure prediction and structural genomics. Science 294:93–96
Article CAS Google Scholar
Schwede T, Sali A, Honig B et al (2009) Outcome of a workshop on applications of protein models in biomedical research. Structure 17:151–159
Article CAS Google Scholar
Read RJ, Adams PD, Arendall WB 3rd et al (2011) A new generation of crystallographic validation tools for the protein data bank. Structure 19:1395–1412
Article CAS Google Scholar
Benkert P, Biasini M, Schwede T (2011) Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics 27:343–350
Article CAS Google Scholar
Benkert P, Kunzli M, Schwede T (2009) QMEAN server for protein model quality estimation. Nucleic Acids Res 37:W510–W514
Article CAS Google Scholar
Haas J, Roth S, Arnold K et al (2013) The protein model portal--a comprehensive resource for protein structure and model information. Database 2013:bat031
Article Google Scholar
Teh AH, Kanamasa S, Kajiwara S et al (2008) Structure of cu/Zn superoxide dismutase from the heavy-metal-tolerant yeast Cryptococcus liquefaciens strain N6. Biochem Biophys Res Commun 374:475–478
Article CAS Google Scholar
Benkert P, Tosatto SC, Schomburg D (2008) QMEAN: a comprehensive scoring function for model quality assessment. Proteins 71:261–277
Article CAS Google Scholar
Chothia C, Lesk AM (1987) Canonical structures for the hypervariable regions of immunoglobulins. J Mol Biol 196:901–917
Article CAS Google Scholar
Morea V, Tramontano A, Rustici M et al (1998) Conformations of the third hypervariable region in the VH domain of immunoglobulins. J Mol Biol 275:269–294
Article CAS Google Scholar
Tramontano A, Chothia C, Lesk AM (1990) Framework residue 71 is a major determinant of the position and conformation of the second hypervariable region in the VH domains of immunoglobulins. J Mol Biol 215:175–182
Article CAS Google Scholar
Messih MA, Lepore R, Marcatili P et al (2014) Improving the accuracy of the structure prediction of the third hypervariable loop of the heavy chains of antibodies. Bioinformatics 30:2733–2740
Article CAS Google Scholar
Almagro JC, Teplyakov A, Luo J et al (2014) Second antibody modeling assessment (AMA-II). Proteins 82:1553–1562
Article CAS Google Scholar
Moult J (2005) A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. Curr Opin Struct Biol 15:285–289
Article CAS Google Scholar
Tai CH, Bai H, Taylor TJ et al (2014) Assessment of template-free modeling in CASP10 and ROLL. Proteins 82(Suppl 2):57–83
Article CAS Google Scholar
Meier A, Soding J (2015) Automatic prediction of protein 3D structures by probabilistic multi-template homology modeling. PLoS Comput Biol 11:e1004343
Article Google Scholar
Larsson P, Wallner B, Lindahl E et al (2008) Using multiple templates to improve quality of homology models in automated homology modeling. Protein Sci 17:990–1002
Article CAS Google Scholar
Cheng J (2008) A multi-template combination algorithm for protein comparative modeling. BMC Struct Biol 8:18
Article Google Scholar
Webb B, Sali A (2014) Comparative protein structure modeling using MODELLER. Curr Protoc Bioinformatics 47:5.6.1–5.6.32
Article Google Scholar
Grosdidier A, Zoete V, Michielin O (2011) Fast docking using the CHARMM force field with EADock DSS. J Comput Chem 32:2149–2159
Article CAS Google Scholar
Grosdidier A, Zoete V, Michielin O (2011) SwissDock, a protein-small molecule docking web service based on EADock DSS. Nucleic Acids Res 39:W270–W277
Article CAS Google Scholar
Lensink MF, Velankar S, Wodak SJ (2017) Modeling protein-protein and protein-peptide complexes: CAPRI 6th edition. Proteins 85:359–377
Article CAS Google Scholar
Esquivel-Rodriguez J, Filos-Gonzalez V, Li B et al (2014) Pairwise and multimeric protein-protein docking using the LZerD program suite. Methods Mol Biol 1137:209–234
Article CAS Google Scholar
Pierce B, Tong W, Weng Z (2005) M-ZDOCK: a grid-based approach for Cn symmetric multimer docking. Bioinformatics 21:1472–1478
Article CAS Google Scholar
De Vries SJ, Van Dijk M, Bonvin AM (2010) The HADDOCK web server for data-driven biomolecular docking. Nat Protoc 5:883–897
Article Google Scholar
Leaver-Fay A, Tyka M, Lewis SM et al (2011) ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 487:545–574
Article CAS Google Scholar
Russel D, Lasker K, Webb B et al (2012) Putting the pieces together: integrative modeling platform software for structure determination of macromolecular assemblies. PLoS Biol 10:e1001244
Article CAS Google Scholar
Simons KT, Kooperberg C, Huang E et al (1997) Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J Mol Biol 268:209–225
Article CAS Google Scholar
Yang J, Yan R, Roy A et al (2015) The I-TASSER suite: protein structure and function prediction. Nat Methods 12:7–8
Article CAS Google Scholar
Maghrabi AHA, Mcguffin LJ (2017) ModFOLD6: an accurate web server for the global and local quality estimation of 3D protein models. Nucleic Acids Res 45(W1):W416–W421
Article CAS Google Scholar
Heo L, Feig M (2018) What makes it difficult to refine protein models further via molecular dynamics simulations? Proteins 86(Suppl 1):177–188
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Biozentrum, University of Basel and SIB Swiss Institute of Bioinformatics, Basel, Switzerland
Gabriel Studer, Gerardo Tauriello, Stefan Bienert, Andrew Mark Waterhouse, Martino Bertoni, Lorenza Bordoli, Torsten Schwede & Rosalba Lepore

Authors

Gabriel Studer
View author publications
You can also search for this author in PubMed Google Scholar
Gerardo Tauriello
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Bienert
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Mark Waterhouse
View author publications
You can also search for this author in PubMed Google Scholar
Martino Bertoni
View author publications
You can also search for this author in PubMed Google Scholar
Lorenza Bordoli
View author publications
You can also search for this author in PubMed Google Scholar
Torsten Schwede
View author publications
You can also search for this author in PubMed Google Scholar
Rosalba Lepore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rosalba Lepore .

Editor information

Editors and Affiliations

GlaxoSmithKline, Cellzome – a GSK company Meyerhofstrasse 1, Heidelberg, Baden-Württemberg, Germany
Tobias Sikosek

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Studer, G. et al. (2019). Modeling of Protein Tertiary and Quaternary Structures Based on Evolutionary Information. In: Sikosek, T. (eds) Computational Methods in Protein Evolution. Methods in Molecular Biology, vol 1851. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8736-8_17

Download citation

DOI: https://doi.org/10.1007/978-1-4939-8736-8_17
Published: 27 September 2018
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-8735-1
Online ISBN: 978-1-4939-8736-8
eBook Packages: Springer Protocols

Publish with us

Policies and ethics