Improving Prediction of Protein Secondary Structure Using Structured Neural Networks and Multiple Sequence Alignments

1 January 1996

journal article
research article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 3 (1) , 163-183
https://doi.org/10.1089/cmb.1996.3.163

Abstract

The prediction of protein secondary structure by use of carefully structured neural networks and multiple sequence alignments has been investigated. Separate networks are used for predicting the three secondary structures α-helix, β-strand, and coil. The networks are designed using a priori knowledge of amino acid properties with respect to the secondary structure and the characteristic periodicity in α-helices. Since these single-structure networks all have less than 600 adjustable weights, overfitting is avoided. To obtain a three-state prediction of α-helix, β-strand, or coil, ensembles of single-structure networks are combined with another neural network. This method gives an overall prediction accuracy of 66.3% when using 7-fold cross-validation on a database of 126 nonhomologous globular proteins. Applying the method to multiple sequence alignments of homologous proteins increases the prediction accuracy significantly to 71.3% with corresponding Matthew's correlation coefficients C_α = 0.59, C_β = 0.52, and C_c = 0.50. More than 72% of the residues in the database are predicted with an accuracy of 80%. It is shown that the network outputs can be interpreted as estimated probabilities of correct prediction, and, therefore, these numbers indicate which residues are predicted with high confidence.

Keywords

This publication has 32 references indexed in Scilit:

Position-based sequence weights
Published by Elsevier ,2004
Volume changes in protein evolution
Journal of Molecular Biology, 1994
Limits on α‐helix prediction with neural network models
Proteins-Structure Function and Bioinformatics, 1992
Neural network ensembles
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1990
Invited review combining forecasts—twenty years later
Journal of Forecasting, 1989
Weights for data related by a tree
Journal of Molecular Biology, 1989
Protein secondary structure and homology by neural networks The α‐helices in rhodopsin
FEBS Letters, 1988
Secondary structure prediction: combination of three different methods
Protein Engineering, Design and Selection, 1988
Further developments of protein secondary structure prediction using information theory
Journal of Molecular Biology, 1987
Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins
Journal of Molecular Biology, 1978