Abstract
The number of known three-dimensional protein sequences is orders of magnitude higher than the number of known protein structures. This is a result of an increase in large-scale genomic sequencing projects, the inability of proteins to crystallize or crystals to diffract well, or a simple lack of resources. An alternative is to use one of a variety of available homology modeling programs to produce a computational model of a protein. Protein models are produced using information from known protein structures found to be similar. Here, we compare the ability of a number of popular homology modeling programs to produce quality models from user-defined target–template sequence alignments over a range of circumstances including low sequence identity, variable sequence length, and when interfaced with a protein or small molecule. Programs evaluated include Prime, SWISS-MODEL, MOE, MODELLER, ROSETTA, Composer, ORCHESTRAR, and I-TASSER. Proteins to be modeled were chosen to test a range of sequence identities, sequence lengths, and protein motifs and all are of scientific importance. These include HIV-1 protease, kinases, dihydrofolate reductase, a viral capsid protein, and factor Xa among others. For the most part, the programs produce results that are similar. For example, all programs are able to produce reasonable models when sequence identities are >30% and all programs have difficulties producing complete models when sequence identities are lower. However, certain programs fare slightly better than others in certain situations and we attempt to provide insight on this topic.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Evers A and Klebe G (2004) Successful virtual screening for a submicromolar antagonist of the neurokinin-1 receptor base on a ligand-supported homology model. J Med Chem 47:5381–5392
Evers A and Klabunde T (2005) Structure-based drug discovery using GPCR homology modeling: Successful virtual screening for antagonists of the alpha1A androgenic receptor. J Med Chem 48:1088–1097
Rasmussen SG, Choi HJ, Rosenbaum DM, Kobilka TS, Thian FS, Edwards PC, Burghammer M, Ratnala VR, Sanishvili R, Fischetti RF, Schertler GF, Weis WI, and Kobilka BK (2007) Crystal structure of the human β2-adrenergic G-protein-coupled receptor. Nature 450:383–7
Cherezov V, Rosenbaum DM, Hanson MA, Rasmussen SG, Thian FS, Kobilka TS, Choi HJ, Kuhn P, Weis WI, Kobilka BK, and Stevens RC (2007) High-resolution crystal structure of an engineered human β2-adrenergic G protein-coupled receptor. Science 318:1258–65
Rosenbaum DM, Cherezov V, Hanson MA, Rasmussen SG, Thian FS, Kobilka TS, Choi HJ, Yao XJ, Weis WI, Stevens RC and Kobilka BK (2007) GPCR engineering yields high-resolution structural insights into β2-adrenergic receptor function. Science 318 (5854):1266–73
Wu CH, Apweiler R, Bairoch A, Natale DA et al (2006) The Universal Protein Resource (UniProt): An expanding universe of protein information. Nucl Acids Res 34:Database issue D187-D191
Schwede T, Kopp J, Guex N, and Peitsch MC (2003) SWISS-MODEL: An automated protein homology-modeling server. Nucl Acids Res 31:3381–3385
Sippl MJ and Weitckus S (1992) Detection of native-like models for amino acid sequences of unknown three-dimensional structure in a database of known protein conformations. Proteins 13:258–271
Abagyan RA, Totrov MM, and Kuznetsov DA (1994) ICM: a new method for protein modeling and design: applications to docking and structure prediction from the distorted native conformation. J Comp Chem 15:488–506
Misura KM, Chivian D, Rohl CA, Kim DE, Baker D (2006) Physically realistic homology models built with ROSETTA can be more accurate than their templates. PNAS 103(14):5361–6
Sali A and Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234:779–815
Montalvao RW, Smith RE, Lovell SC and Blundell TL (2005) CHORAL: A differential geometry approach to the prediction of the cores of protein structures. Bioinformatics 21:3719–3725
Smith RE, Lovell SC, Burke DF, Montalvao RW and Blundell TL (2007) Andante: reducing side-chain rotamer search space during comparative modeling using environment-specific substitution probabilities. Bioinformatics 23:1099–105
Deane CM and Blundell TL (2001) CODA: A combined algorithm for predicting the structurally variable regions of protein models. Protein Sci 10:599–612
Sali A and Blundell TL (1990) Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming. J Mol Biol 212:403–28
Zhu ZY, Sali A and Blundell TL (1992) A variable gap penalty function and feature weights for protein 3-D structure comparisons. Protein Eng 5:43–51
Sutcliffe MJ, Haneef I, Carney D, Blundell TL (1987a) Knowledge-based modeling of homologous proteins, Part 1: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures. Protein Eng 1:377–384
Sutcliffe MJ, Hayes FR, Blundell TL (1987b) Knowledge-based modeling of homologous proteins, Part 2: Rules for the conformations of substituted sidechains. Protein Eng. 1:385
Levitt M (1992) Accurate modeling of protein conformation by automatic segment matching. J Mol Biol 226:507–533
MOE. Chemical Computing Group, Montreal, Quebec, Canada.
Prime. Schrödinger, LLC, Portland, OR
Tramontano A, Cozzetto D, Giorgetti A, Raimondo D (2007) The assessment of methods for protein structure prediction. Methods Mol Biol 413:43–58
Nayeem A, Sitkoff D, Krystek S (2006) A comparative study of available software for high-accuracy homology modeling: from sequence alignments to structural models. Protein Sci 15:808–24
Wallner B, Elofsson A (2005) All are not equal: A benchmark of different homology modeling programs. Protein Sci 14:1315–1327
Dolan MA, Keil M, Baker DS (2008) Comparison of Composer and ORCHESTRAR. Proteins 72:1243–58
Jorgensen WL, Maxwell DS and Tirado-Rives J (1996) Development and testing of the OPLS all-atom force field on conformational energetics and properties of organic liquids. J Am Chem Soc 118:11225–11236
Kaminski GA, Friesner RA, Tirado-Rives J and Jorgensen WL (2001) Evaluation and reparametrization of the OPLS-AA force field for proteins via comparison with accurate quantum chemical calculations on peptides. J Phys Chem B 105:6474–6487
Gallicchio E, Zhang LY and Levy RM (2002) The SGB/NP hydration free energy model based on the surface generalized born solvent reaction field and novel nonpolar hydration free energy estimators. J Comp Chem 23:517–529
Jacobson MP, Pincus DL, Rapp CS, Day TJF, Honig B, Shaw DE, Friesner RA (2004) A hierarchical approach to all-atom protein loop prediction Proteins 55:351–367
Fechteler T, Dengler U, and Schomburg D (1995) Prediction of protein three-dimensional structures in insertion and deletion regions: A procedure for searching data bases of representative protein fragments using geometric scoring criteria. J Mol Biol 253:114–131
Peitsch MC (1996) ProMod and Swiss-Model: Internet-based tools for automated comparative protein modeling. Biochem Soc Trans 24(1):274–279
Van Gunsteren WF, Billeter SR, Eising AA, Hünenberger PH, Krüger P, Mark AE, Scott WRP, and Tironi IG (1996) Biomolecular Simulation: The GROMOS96 Manual and User Guide, pp. 1–1042. Vdf Hochschulverlag AG an der ETH Zürich, Zürich, Switzerland
Shen M-y, Sali A (2006) Statistical potential for assessment and prediction of protein structures. Protein Science 15:2507–2524
Eramian D, Shen M-y, Devos D, Melo F, Sali A and Marti-Renom MA (2006) A composite score for predicting errors in protein structure models. Protein Science 15:1653–1666
Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nature Protocols 5:725–738
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, and Bourne PE (2000) The Protein Data Bank. Nucl Acids Res 28:235–242
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W and Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res 25:3389–3402
Shi J, Blundell TL, and Mizuguchi K (2001) FUGUE: Sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. J Mol Biol 310:243–257
de Bakker PIW, Bateman A, Burke DF, Miguel RN, Mizuguchi K, Shi J, Shirai H, and Blundell TL (2001) HOMSTRAD: Adding sequence information to structure-based alignments of homologous protein families. Bioinformatics 17:748–749
Mizuguchi K, Deane C, Blundell T, and Overington J (1998) HOMSTRAD: A database of protein structure alignments for homologous families. Protein Sci 7:2469–2471
Sherman W, Day T, Jacobson MP, Friesner RA, Farid R (2006) Novel procedure for modeling ligand/receptor induced fit effects. J Med Chem 49:534–553
Acknowledgments
The authors would like to thank Dr. Judith Hobrath for her technical assistance.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media,LLC
About this protocol
Cite this protocol
Dolan, M.A., Noah, J.W., Hurt, D. (2011). Comparison of Common Homology Modeling Algorithms: Application of User-Defined Alignments. In: Orry, A., Abagyan, R. (eds) Homology Modeling. Methods in Molecular Biology, vol 857. Humana Press. https://doi.org/10.1007/978-1-61779-588-6_18
Download citation
DOI: https://doi.org/10.1007/978-1-61779-588-6_18
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-587-9
Online ISBN: 978-1-61779-588-6
eBook Packages: Springer Protocols