Model quality assessment using distance constraints from alignments
- 10 September 2008
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 75 (3) , 540-549
- https://doi.org/10.1002/prot.22262
Abstract
Given a set of alternative models for a specific protein sequence, the model quality assessment (MQA) problem asks for an assignment of scores to each model in the set. A good MQA program assigns these scores such that they correlate well with real quality of the models, ideally scoring best that model which is closest to the true structure. In this article, we present a new approach for addressing the MQA problem. It is based on distance constraints extracted from alignments to templates of known structure, and is implemented in the Undertaker program for protein structure prediction. One novel feature is that we extract noncontact constraints as well as contact constraints. We describe how the distance constraint extraction is done and we show how they can be used to address the MQA problem. We have compared our method on CASP7 targets and the results show that our method is at least comparable with the best MQA methods that were assessed at CASP7. We also propose a new evaluation measure, Kendall's τ, that is more interpretable than conventional measures used for evaluating MQA methods (Pearson's r and Spearman's ρ). We show clear examples where Kendall's τ agrees much more with our intuition of a correct MQA, and we therefore propose that Kendall's τ be used for future CASP MQA assessments. Proteins 2009.Keywords
This publication has 23 references indexed in Scilit:
- Applying undertaker cost functions to model quality assessmentProteins-Structure Function and Bioinformatics, 2008
- Predict-2nd: a tool for generalized protein local structure predictionBioinformatics, 2008
- Benchmarking consensus model quality assessment for protein fold recognitionBMC Bioinformatics, 2007
- Assessment of predictions in the model quality assessment categoryProteins-Structure Function and Bioinformatics, 2007
- SAM-T04: What is new in protein-structure prediction for CASP6Proteins-Structure Function and Bioinformatics, 2005
- A graph‐theory algorithm for rapid protein side‐chain predictionProtein Science, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- Basic local alignment search toolJournal of Molecular Biology, 1990
- A Computer Method for Calculating Kendall's Tau with Ungrouped DataJournal of the American Statistical Association, 1966