Evaluating protein structures determined by structural genomics consortia
Top Cited Papers
- 13 February 2007
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 66 (4) , 778-795
- https://doi.org/10.1002/prot.21165
Abstract
Structural genomics projects are providing large quantities of new 3D structural data for proteins. To monitor the quality of these data, we have developed the protein structure validation software suite (PSVS), for assessment of protein structures generated by NMR or X‐ray crystallographic methods. PSVS is broadly applicable for structure quality assessment in structural biology projects. The software integrates under a single interface analyses from several widely‐used structure quality evaluation tools, including PROCHECK (Laskowski et al., J Appl Crystallog 1993;26:283–291), MolProbity (Lovell et al., Proteins 2003;50:437–450), Verify3D (Luthy et al., Nature 1992;356:83–85), ProsaII (Sippl, Proteins 1993;17: 355–362), the PDB validation software, and various structure‐validation tools developed in our own laboratory. PSVS provides standard constraint analyses, statistics on goodness‐of‐fit between structures and experimental data, and knowledge‐based structure quality scores in standardized format suitable for database integration. The analysis provides both global and site‐specific measures of protein structure quality. Global quality measures are reported as Z scores, based on calibration with a set of high‐resolution X‐ray crystal structures. PSVS is particularly useful in assessing protein structures determined by NMR methods, but is also valuable for assessing X‐ray crystal structures or homology models. Using these tools, we assessed protein structures generated by the Northeast Structural Genomics Consortium and other international structural genomics projects, over a 5‐year period. Protein structures produced from structural genomics projects exhibit quality score distributions similar to those of structures produced in traditional structural biology projects during the same time period. However, while some NMR structures have structure quality scores similar to those seen in higher‐resolution X‐ray crystal structures, the majority of NMR structures have lower scores. Potential reasons for this “structure quality score gap” between NMR and X‐ray crystal structures are discussed. Proteins 2007.Keywords
This publication has 50 references indexed in Scilit:
- Assessing precision and accuracy of protein structures derived from NMR dataProteins-Structure Function and Bioinformatics, 2005
- Structure validation by Cα geometry: ϕ,ψ and Cβ deviationProteins-Structure Function and Bioinformatics, 2003
- The Protein Data BankNucleic Acids Research, 2000
- Structural genomics: keystone for a Human Proteome Project.Nature Structural & Molecular Biology, 1999
- AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMRJournal of Biomolecular NMR, 1996
- Recognition of errors in three‐dimensional structures of proteinsProteins-Structure Function and Bioinformatics, 1993
- PROCHECK: a program to check the stereochemical quality of protein structuresJournal of Applied Crystallography, 1993
- Assessment of protein models with three-dimensional profilesNature, 1992
- Free R value: a novel statistical quantity for assessing the accuracy of crystal structuresNature, 1992
- WHAT IF: A molecular modeling and drug design programJournal of Molecular Graphics, 1990