Improving sequence-based fold recognition by using 3D model quality assessment
Open Access
- 14 June 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (17) , 3509-3515
- https://doi.org/10.1093/bioinformatics/bti540
Abstract
Motivation: The ability of a simple method (MODCHECK) to determine the sequence–structure compatibility of a set of structural models generated by fold recognition is tested in a thorough benchmark analysis. Four Model Quality Assessment Programs (MQAPs) were tested on 188 targets from the latest LiveBench-9 automated structure evaluation experiment. We systematically test and evaluate whether the MQAP methods can successfully detect native-likemodels. Results: We show that compared with the other three methods tested MODCHECK is the most reliable method for consistently performing the best top model selection and for ranking the models. In addition, we show that the choice of model similarity score used to assess a model's similarity to the experimental structure can influence the overall performance of these tools. Although these MQAP methods fail to improve the model selection performance for methods that already incorporate protein three dimension (3D) structural information, an improvement is observed for methods that are purely sequence-based, including the best profile–profile methods. This suggests that even the best sequence-based fold recognition methods can still be improved by taking into account the 3D structural information. Contact:d.jones@cs.ucl.ac.ukKeywords
This publication has 22 references indexed in Scilit:
- ORFeus: detection of distant homology using sequence profiles and predicted secondary structureNucleic Acids Research, 2003
- mRNA:guanine-N 7 cap methyltransferases: identification of novel members of the family, evolutionary analysis, homology modeling, and analysis of sequence-structure-function relationshipsBMC Bioinformatics, 2001
- LiveBench‐1: Continuous benchmarking of protein structure prediction serversProtein Science, 2001
- Improving the quality of twilight‐zone alignmentsProtein Science, 2000
- GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequencesJournal of Molecular Biology, 1999
- Hidden Markov models for detecting remote protein homologies.Bioinformatics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- A new approach to protein fold recognitionNature, 1992
- Evaluation of protein models by atomic solvation preferenceJournal of Molecular Biology, 1992
- Identification of native protein folds amongst a large number of incorrect modelsJournal of Molecular Biology, 1990