Protein contact prediction using patterns of correlation
- 14 May 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 56 (4) , 679-684
- https://doi.org/10.1002/prot.20160
Abstract
We describe a new method for using neural networks to predict residue contact pairs in a protein. The main inputs to the neural network are a set of 25 measures of correlated mutation between all pairs of residues in two “windows” of size 5 centered on the residues of interest. While the individual pair‐wise correlations are a relatively weak predictor of contact, by training the network on windows of correlation the accuracy of prediction is significantly improved. The neural network is trained on a set of 100 proteins and then tested on a disjoint set of 1033 proteins of known structure. An average predictive accuracy of 21.7% is obtained taking the best L/2 predictions for each protein, where L is the sequence length. Taking the best L/10 predictions gives an average accuracy of 30.7%. The predictor is also tested on a set of 59 proteins from the CASP5 experiment. The accuracy is found to be relatively consistent across different sequence lengths, but to vary widely according to the secondary structure. Predictive accuracy is also found to improve by using multiple sequence alignments containing many sequences to calculate the correlations. Proteins 2004.Keywords
This publication has 17 references indexed in Scilit:
- EVA: evaluation of protein structure prediction serversNucleic Acids Research, 2003
- Prediction of protein residue contacts with a PDB-derived likelihood matrixProtein Engineering, Design and Selection, 2002
- Prediction of contact maps with neural networks and correlated mutationsProtein Engineering, Design and Selection, 2001
- Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutationsProteins-Structure Function and Bioinformatics, 2001
- The PSIPRED protein structure prediction serverBioinformatics, 2000
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- A neural network based predictor of residue contacts in proteinsProtein Engineering, Design and Selection, 1999
- Improving contact predictions by the combination of correlated mutations and other sources of sequence informationFolding and Design, 1997
- Protein fold recognition and dynamics in the space of contact mapsProteins-Structure Function and Bioinformatics, 1996
- Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c551Journal of Molecular Biology, 1971