Improving protein secondary structure prediction with aligned homologous sequences
- 1 January 1996
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 5 (1) , 106-113
- https://doi.org/10.1002/pro.5560050113
Abstract
Most recent protein secondary structure prediction methods use sequence alignments to improve the prediction quality. We investigate the relationship between the location of secondary structural elements, gaps, and variable residue positions in multiple sequence alignments. We further investigate how these relationships compare with those found in structurally aligned protein families. We show how such associations may be used to improve the quality of prediction of the secondary structure elements, using the Quadratic‐Logistic method with profiles. Furthermore, we analyze the extent to which the number of homologous sequences influences the quality of prediction. The analysis of variable residue positions shows that surprisingly, helical regions exhibit greater variability than do coil regions, which are generally thought to be the most common secondary structure elements in loops. However, the correlation between variability and the presence of helices does not significantly improve prediction quality. Gaps are a distinct signal for coil regions. Increasing the coil propensity for those residues occurring in gap regions enhances the overall prediction quality. Prediction accuracy increases initially with the number of homologues, but changes negligibly as the number of homologues exceeds about 14. The alignment quality affects the prediction more than other factors, hence a careful selection and alignment of even a small number of homologues can lead to significant improvements in prediction accuracy.Keywords
This publication has 31 references indexed in Scilit:
- Prediction of Protein Secondary Structure by Combining Nearest-neighbor Algorithms and Multiple Sequence AlignmentsJournal of Molecular Biology, 1995
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Predicting protein secondary structure with a nearest-neighbor algorithmJournal of Molecular Biology, 1992
- Predicting protein secondary structure using neural net and statistical methodsJournal of Molecular Biology, 1992
- Machine learning approach for the prediction of protein secondary structureJournal of Molecular Biology, 1990
- Improvements in a secondary structure prediction method based on a search for local sequence homologies and its use as a model building toolBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1988
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Comparative model-building of the mammalian serine proteasesJournal of Molecular Biology, 1981
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978