Prediction of disordered regions in proteins from position specific score matrices
- 21 October 2003
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 53 (S6) , 573-578
- https://doi.org/10.1002/prot.10528
Abstract
We describe here the results of using a neural network based method (DISOPRED) for predicting disordered regions in 55 proteins in the 5th CASP experiment. A set of 715 highly resolved proteins with regions of disorder was used to train the network. The inputs to the network were derived from sequence profiles generated by PSI‐BLAST. A post‐filter was applied to the output of the network to prevent regions being predicted as disordered in regions of confidently predicted alpha helix or beta sheet structure. The overall two‐state prediction accuracy for the method is very high (90%) but this is highly skewed by the fact that most residues are observed to be ordered. The overall Matthews' correlation coefficient for the submitted predictions is 0.34, which gives a more realistic impression of the overall accuracy of the method, though still indicates significant predictive power. Proteins 2003;53:573–578.Keywords
This publication has 8 references indexed in Scilit:
- Comparison of the predicted and observed secondary structure of T4 phage lysozymePublished by Elsevier ,2003
- Evolutionary Rate Heterogeneity in Proteins with Long Disordered RegionsJournal of Molecular Evolution, 2002
- The Protein Data BankNucleic Acids Research, 2000
- Intrinsically unstructured proteins: re-assessing the protein structure-function paradigmJournal of Molecular Biology, 1999
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Folding transition in the DMA-binding domain of GCN4 on specific binding to DNANature, 1990