Improved prediction of protein secondary structure by use of sequence profiles and neural networks.
- 15 August 1993
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 90 (16) , 7558-7562
- https://doi.org/10.1073/pnas.90.16.7558
Abstract
The explosive accumulation of protein sequences in the wake of large-scale sequencing projects is in stark contrast to the much slower experimental determination of protein structures. Improved methods of structure prediction from the gene sequence alone are therefore needed. Here, we report a substantial increase in both the accuracy and quality of secondary-structure predictions, using a neural-network algorithm. The main improvements come from the use of multiple sequence alignments (better overall accuracy), from ''balanced training'' (better prediction of beta-strands), and from ''structure context training'' (better prediction of helix and strand lengths). This method, cross-validated on seven different test sets purged of sequence similarity to learning sets, achieves a three-state prediction accuracy of 69.7%, significantly better than previous methods. In addition, the predicted structures have a more realistic distribution of helix and strand segments. The predictions may be suitable for use in practice as a first estimate of the structural type of newly sequenced proteins.Keywords
This publication has 26 references indexed in Scilit:
- Prediction of protein secondary structure and active sites using the alignment of homologous sequencesPublished by Elsevier ,2004
- Predicting protein secondary structure with a nearest-neighbor algorithmJournal of Molecular Biology, 1992
- Hybrid system for protein secondary structure predictionJournal of Molecular Biology, 1992
- Conservation analysis and structure prediction of the SH2 family of phosphotyrosine binding domainsFEBS Letters, 1992
- Predicting protein secondary structure using neural net and statistical methodsJournal of Molecular Biology, 1992
- Selection of representative protein data setsProtein Science, 1992
- Improvements in protein secondary structure prediction by an enhanced neural networkJournal of Molecular Biology, 1990
- Improvements in a secondary structure prediction method based on a search for local sequence homologies and its use as a model building toolBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1988
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983