Improving Prediction of Protein Secondary Structure Using Structured Neural Networks and Multiple Sequence Alignments
- 1 January 1996
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 3 (1) , 163-183
- https://doi.org/10.1089/cmb.1996.3.163
Abstract
The prediction of protein secondary structure by use of carefully structured neural networks and multiple sequence alignments has been investigated. Separate networks are used for predicting the three secondary structures α-helix, β-strand, and coil. The networks are designed using a priori knowledge of amino acid properties with respect to the secondary structure and the characteristic periodicity in α-helices. Since these single-structure networks all have less than 600 adjustable weights, overfitting is avoided. To obtain a three-state prediction of α-helix, β-strand, or coil, ensembles of single-structure networks are combined with another neural network. This method gives an overall prediction accuracy of 66.3% when using 7-fold cross-validation on a database of 126 nonhomologous globular proteins. Applying the method to multiple sequence alignments of homologous proteins increases the prediction accuracy significantly to 71.3% with corresponding Matthew's correlation coefficients Cα = 0.59, Cβ = 0.52, and Cc = 0.50. More than 72% of the residues in the database are predicted with an accuracy of 80%. It is shown that the network outputs can be interpreted as estimated probabilities of correct prediction, and, therefore, these numbers indicate which residues are predicted with high confidence.Keywords
This publication has 32 references indexed in Scilit:
- Position-based sequence weightsPublished by Elsevier ,2004
- Volume changes in protein evolutionJournal of Molecular Biology, 1994
- Limits on α‐helix prediction with neural network modelsProteins-Structure Function and Bioinformatics, 1992
- Neural network ensemblesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990
- Invited review combining forecasts—twenty years laterJournal of Forecasting, 1989
- Weights for data related by a treeJournal of Molecular Biology, 1989
- Protein secondary structure and homology by neural networks The α‐helices in rhodopsinFEBS Letters, 1988
- Secondary structure prediction: combination of three different methodsProtein Engineering, Design and Selection, 1988
- Further developments of protein secondary structure prediction using information theoryJournal of Molecular Biology, 1987
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978