Assessing model accuracy using the homology modeling automatically software
- 19 July 2007
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 70 (1) , 105-118
- https://doi.org/10.1002/prot.21466
Abstract
Homology modeling is a powerful technique that greatly increases the value of experimental structure determination by using the structural information of one protein to predict the structures of homologous proteins. We have previously described a method of homology modeling by satisfaction of spatial restraints (Li et al., Protein Sci 1997;6:956–970). The Homology Modeling Automatically (HOMA) web site, , is a new tool, using this method to predict 3D structure of a target protein based on the sequence alignment of the target protein to a template protein and the structure coordinates of the template. The user is presented with the resulting models, together with an extensive structure validation report providing critical assessments of the quality of the resulting homology models. The homology modeling method employed by HOMA was assessed and validated using twenty‐four groups of homologous proteins. Using HOMA, homology models were generated for 510 proteins, including 264 proteins modeled with correct folds and 246 modeled with incorrect folds. Accuracies of these models were assessed by superimposition on the corresponding experimentally determined structures. A subset of these results was compared with parallel studies of modeling accuracy using several other automated homology modeling approaches. Overall, HOMA provides prediction accuracies similar to other state‐of‐the‐art homology modeling methods. We also provide an evaluation of several structure quality validation tools in assessing the accuracy of homology models generated with HOMA. This study demonstrates that Verify3D (Luthy et al., Nature 1992;356:83–85) and ProsaII (Sippl, Proteins 1993;17:355–362) are most sensitive in distinguishing between homology models with correct or incorrect folds. For homology models that have the correct fold, the steric conformational energy (including primarily the Van der Waals energy), MolProbity clashscore (Word et al., Protein Sci 2000;9:2251–2259), and the PROCHECK G‐factors (Laskowski et al., J Biomol NMR 1996;8:477–486) provide sensitive and consistent methods for assessing accuracy and can distinguish between homology models of higher and lower accuracy. As demonstrated in the accompanying paper (Bhattacharya et al., accompanying paper), combinations of these scores for models generated with HOMA provide a basis for distinguishing low from high accuracy models. Proteins 2008.Keywords
This publication has 53 references indexed in Scilit:
- Evaluating protein structures determined by structural genomics consortiaProteins-Structure Function and Bioinformatics, 2007
- A comparative study of available software for high‐accuracy homology modeling: From sequence alignments to structural modelsProtein Science, 2006
- SWISS-MODEL: an automated protein homology-modeling serverNucleic Acids Research, 2003
- Statistical potentials for fold assessmentProtein Science, 2002
- Modeling of loops in protein structuresProtein Science, 2000
- AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMRJournal of Biomolecular NMR, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- Assessment of protein models with three-dimensional profilesNature, 1992
- Free R value: a novel statistical quantity for assessing the accuracy of crystal structuresNature, 1992