A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%
- 31 December 1995
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 4 (12) , 2517-2525
- https://doi.org/10.1002/pro.5560041208
Abstract
To improve secondary structure predictions in protein sequences, the information residing in multiple sequence alignments of substituted but structurally related proteins is exploited. A database comprised of 70 protein families and a total of 2, 500 sequences, some of which were aligned by tertiary structural superpositions, was used to calculate residue exchange weight matrices within α‐helical, β‐strand, and coil substructures, respectively. Secondary structure predictions were made based on the observed residue substitutions in local regions of the multiple alignments and the largest possible associated exchange weights in each of the three matrix types. Comparison of the observed and predicted secondary structure on a per‐residue basis yielded a mean accuracy of 72.2%. Individual α‐helix, β‐strand, and coil states were respectively predicted at 66.4, 66.7, and 75.8°7o correctness, representing a well‐balanced three‐state prediction. The accuracy level, verified by cross‐validation through jack‐knife tests on all protein families, dropped, on average, to only 70.9%, indicating the rigor of the prediction procedure. On the basis of robustness, conceptual clarity, accuracy, and executable efficiency, the method has considerable advantage, especially with its sole reliance on amino acid substitutions within structurally related proteins.Keywords
This publication has 31 references indexed in Scilit:
- Prediction of protein secondary structure and active sites using the alignment of homologous sequencesPublished by Elsevier ,2004
- Prediction of Protein Secondary Structure by Combining Nearest-neighbor Algorithms and Multiple Sequence AlignmentsJournal of Molecular Biology, 1995
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Hybrid system for protein secondary structure predictionJournal of Molecular Biology, 1992
- Conservation analysis and structure prediction of the SH2 family of phosphotyrosine binding domainsFEBS Letters, 1992
- Improvements in protein secondary structure prediction by an enhanced neural networkJournal of Molecular Biology, 1990
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978
- Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structureJournal of Molecular Biology, 1974