Abstract
Proteins are subject to evolutionary forces that shape their three-dimensional structure to meet specific functional demands. The knowledge of the structure of a protein is therefore instrumental to gain information about the molecular basis of its function. However, experimental structure determination is inherently time consuming and expensive, making it impossible to follow the explosion of sequence data deriving from genome-scale projects. As a consequence, computational structural modeling techniques have received much attention and established themselves as a valuable complement to experimental structural biology efforts. Among these, comparative modeling remains the method of choice to model the three-dimensional structure of a protein when homology to a protein of known structure can be detected.
The general strategy consists of using experimentally determined structures of proteins as templates for the generation of three-dimensional models of related family members (targets) of which the structure is unknown. This chapter provides a description of the individual steps needed to obtain a comparative model using SWISS-MODEL, one of the most widely used automated servers for protein structure homology modeling.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Guex N, Peitsch MC, Schwede T (2009) Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: a historical perspective. Electrophoresis 30 Suppl 1:S162–S173
Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234:779–815
Chothia C, Lesk AM (1986) The relation between the divergence of sequence and structure in proteins. EMBO J 5:823–826
Arnold K, Bordoli L, Kopp J et al (2006) The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22:195–201
Biasini M, Bienert S, Waterhouse A et al (2014) SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res 42:W252–W258
Kiefer F, Arnold K, Kunzli M et al (2009) The SWISS-MODEL repository and associated resources. Nucleic Acids Res 37:D387–D392
Waterhouse A, Bertoni M, Bienert S et al (2018) SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research Res 46(W1):W296–W303
Kryshtafovych A, Venclovas C, Fidelis K et al (2005) Progress over the first decade of CASP experiments. Proteins 61(Suppl 7):225–236
Berman H, Henrick K, Nakamura H et al (2007) The worldwide protein data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res 35:D301–D303
Altschul SF, Madden TL, Schaffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Remmert M, Biegert A, Hauser A et al (2011) HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods 9:173–175
Jones DT (1999) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292:195–202
Sillitoe I, Cuff AL, Dessailly BH et al (2013) New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res 41:D490–D498
Aloy P, Ceulemans H, Stark A et al (2003) The relationship between sequence and interaction divergence in proteins. J Mol Biol 332:989–998
Bertoni M, Kiefer F, Biasini M et al (2017) Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology. Sci Rep 7:10480
Marcatili P, Olimpieri PP, Chailyan A et al (2014) Antibody modeling using the prediction of immunoglobulin structure (PIGS) web server [corrected]. Nat Protoc 9:2771–2783
Lepore R, Olimpieri PP, Messih MA et al (2017) PIGSPro: prediction of immunoGlobulin structures v2. Nucleic Acids Res 45:W17
Biasini M, Schmidt T, Bienert S et al (2013) OpenStructure: an integrated software framework for computational structural biology. Acta Crystallogr D Biol Crystallogr 69:701–709
Fiser A (2010) Template-based protein structure modeling. Methods Mol Biol 673:73–94
Choi Y, Deane CM (2010) FREAD revisited: accurate loop structure prediction using a database search algorithm. Proteins 78:1431–1440
Liang S, Zhang C, Zhou Y (2014) LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J Comput Chem 35:335–341
Messih MA, Lepore R, Tramontano A (2015) LoopIng: a template-based tool for predicting the structure of protein loops. Bioinformatics 31:3767–3772
Canutescu AA, Dunbrack RL Jr (2003) Cyclic coordinate descent: a robotics algorithm for protein loop closure. Protein science: a publication of the protein. Society 12:963–972
Sippl MJ (1990) Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213:859–883
Shapovalov MV, Dunbrack RL Jr (2011) A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 19:844–858
Krivov GG, Shapovalov MV, Dunbrack RL Jr (2009) Improved prediction of protein side-chain conformations with SCWRL4. Proteins 77:778–795
Xu J (2005) Rapid protein side-chain packing via tree decomposition. In: Miyano S, Mesirov J, Kasif S, Istrail S, Pevzner PA, Waterman M (eds) Research in computational molecular biology: 9th Annual International Conference, RECOMB 2005, Cambridge, MA, USA, May 14–18, 2005. Proceedings. Springer Berlin, Heidelberg, pp 423–439
Mackerell AD Jr, Feig M, Brooks CL 3rd (2004) Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem 25:1400–1415
Eastman P, Swails J, Chodera JD et al (2017) OpenMM 7: rapid development of high performance algorithms for molecular dynamics. PLoS Comput Biol 13:e1005659
Baker D, Sali A (2001) Protein structure prediction and structural genomics. Science 294:93–96
Schwede T, Sali A, Honig B et al (2009) Outcome of a workshop on applications of protein models in biomedical research. Structure 17:151–159
Read RJ, Adams PD, Arendall WB 3rd et al (2011) A new generation of crystallographic validation tools for the protein data bank. Structure 19:1395–1412
Benkert P, Biasini M, Schwede T (2011) Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics 27:343–350
Benkert P, Kunzli M, Schwede T (2009) QMEAN server for protein model quality estimation. Nucleic Acids Res 37:W510–W514
Haas J, Roth S, Arnold K et al (2013) The protein model portal--a comprehensive resource for protein structure and model information. Database 2013:bat031
Teh AH, Kanamasa S, Kajiwara S et al (2008) Structure of cu/Zn superoxide dismutase from the heavy-metal-tolerant yeast Cryptococcus liquefaciens strain N6. Biochem Biophys Res Commun 374:475–478
Benkert P, Tosatto SC, Schomburg D (2008) QMEAN: a comprehensive scoring function for model quality assessment. Proteins 71:261–277
Chothia C, Lesk AM (1987) Canonical structures for the hypervariable regions of immunoglobulins. J Mol Biol 196:901–917
Morea V, Tramontano A, Rustici M et al (1998) Conformations of the third hypervariable region in the VH domain of immunoglobulins. J Mol Biol 275:269–294
Tramontano A, Chothia C, Lesk AM (1990) Framework residue 71 is a major determinant of the position and conformation of the second hypervariable region in the VH domains of immunoglobulins. J Mol Biol 215:175–182
Messih MA, Lepore R, Marcatili P et al (2014) Improving the accuracy of the structure prediction of the third hypervariable loop of the heavy chains of antibodies. Bioinformatics 30:2733–2740
Almagro JC, Teplyakov A, Luo J et al (2014) Second antibody modeling assessment (AMA-II). Proteins 82:1553–1562
Moult J (2005) A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. Curr Opin Struct Biol 15:285–289
Tai CH, Bai H, Taylor TJ et al (2014) Assessment of template-free modeling in CASP10 and ROLL. Proteins 82(Suppl 2):57–83
Meier A, Soding J (2015) Automatic prediction of protein 3D structures by probabilistic multi-template homology modeling. PLoS Comput Biol 11:e1004343
Larsson P, Wallner B, Lindahl E et al (2008) Using multiple templates to improve quality of homology models in automated homology modeling. Protein Sci 17:990–1002
Cheng J (2008) A multi-template combination algorithm for protein comparative modeling. BMC Struct Biol 8:18
Webb B, Sali A (2014) Comparative protein structure modeling using MODELLER. Curr Protoc Bioinformatics 47:5.6.1–5.6.32
Grosdidier A, Zoete V, Michielin O (2011) Fast docking using the CHARMM force field with EADock DSS. J Comput Chem 32:2149–2159
Grosdidier A, Zoete V, Michielin O (2011) SwissDock, a protein-small molecule docking web service based on EADock DSS. Nucleic Acids Res 39:W270–W277
Lensink MF, Velankar S, Wodak SJ (2017) Modeling protein-protein and protein-peptide complexes: CAPRI 6th edition. Proteins 85:359–377
Esquivel-Rodriguez J, Filos-Gonzalez V, Li B et al (2014) Pairwise and multimeric protein-protein docking using the LZerD program suite. Methods Mol Biol 1137:209–234
Pierce B, Tong W, Weng Z (2005) M-ZDOCK: a grid-based approach for Cn symmetric multimer docking. Bioinformatics 21:1472–1478
De Vries SJ, Van Dijk M, Bonvin AM (2010) The HADDOCK web server for data-driven biomolecular docking. Nat Protoc 5:883–897
Leaver-Fay A, Tyka M, Lewis SM et al (2011) ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 487:545–574
Russel D, Lasker K, Webb B et al (2012) Putting the pieces together: integrative modeling platform software for structure determination of macromolecular assemblies. PLoS Biol 10:e1001244
Simons KT, Kooperberg C, Huang E et al (1997) Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J Mol Biol 268:209–225
Yang J, Yan R, Roy A et al (2015) The I-TASSER suite: protein structure and function prediction. Nat Methods 12:7–8
Maghrabi AHA, Mcguffin LJ (2017) ModFOLD6: an accurate web server for the global and local quality estimation of 3D protein models. Nucleic Acids Res 45(W1):W416–W421
Heo L, Feig M (2018) What makes it difficult to refine protein models further via molecular dynamics simulations? Proteins 86(Suppl 1):177–188
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Studer, G. et al. (2019). Modeling of Protein Tertiary and Quaternary Structures Based on Evolutionary Information. In: Sikosek, T. (eds) Computational Methods in Protein Evolution. Methods in Molecular Biology, vol 1851. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8736-8_17
Download citation
DOI: https://doi.org/10.1007/978-1-4939-8736-8_17
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-8735-1
Online ISBN: 978-1-4939-8736-8
eBook Packages: Springer Protocols