PROFcon: novel prediction of long-range contacts
Open Access
- 12 May 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (13) , 2960-2968
- https://doi.org/10.1093/bioinformatics/bti454
Abstract
Motivation: Despite the continuing advance in the experimental determination of protein structures, the gap between the number of known protein sequences and structures continues to increase. Prediction methods can bridge this sequence–structure gap only partially. Better predictions of non-local contacts between residues could improve comparative modeling, fold recognition and could assist in the experimental structure determination. Results: Here, we introduced PROFcon, a novel contact prediction method that combines information from alignments, from predictions of secondary structure and solvent accessibility, from the region between two residues and from the average properties of the entire protein. In contrast to some other methods, PROFcon predicted short and long proteins at similar levels of accuracy. As expected, PROFcon was clearly less accurate when tested on sparse evolutionary profiles, that is, on families with few homologs. Prediction accuracy was highest for proteins belonging to the SCOP alpha/beta class. PROFcon compared favorably with state-of-the-art prediction methods at the CASP6 meeting. While the performance may still be perceived as low, our method clearly pushed the mark higher. Furthermore, predictions are already accurate enough to seed predictions of global features of protein structure. Availability:http://www.predictprotein.org/submit_profcon.html Contact:punta@cubic.bioc.columbia.edu Supplementary information:http://www.rostlab.org/results/2005/profconKeywords
This publication has 48 references indexed in Scilit:
- SCOP database in 2004: refinements integrate structure and sequence family dataNucleic Acids Research, 2004
- Common intervals and sorting by reversals: a marriage of necessityBioinformatics, 2002
- The Protein Data BankActa Crystallographica Section D-Biological Crystallography, 2002
- EVA: continuous automatic evaluation of protein structure prediction serversBioinformatics, 2001
- Prediction of contact maps with neural networks and correlated mutationsProtein Engineering, Design and Selection, 2001
- CAFASP2: The second critical assessment of fully automated structure prediction methodsProteins-Structure Function and Bioinformatics, 2001
- A neural network based predictor of residue contacts in proteinsProtein Engineering, Design and Selection, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Protein secondary structure and homology by neural networks The α‐helices in rhodopsinFEBS Letters, 1988
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977