Improving the prediction accuracy of residue solvent accessibility and real‐value backbone torsion angles of proteins by guided‐learning through a two‐layer neural network
- 14 August 2008
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 74 (4) , 847-856
- https://doi.org/10.1002/prot.22193
Abstract
This article attempts to increase the prediction accuracy of residue solvent accessibility and real‐value backbone torsion angles of proteins through improved learning. Most methods developed for improving the backpropagation algorithm of artificial neural networks are limited to small neural networks. Here, we introduce a guided‐learning method suitable for networks of any size. The method employs a part of the weights for guiding and the other part for training and optimization. We demonstrate this technique by predicting residue solvent accessibility and real‐value backbone torsion angles of proteins. In this application, the guiding factor is designed to satisfy the intuitive condition that for most residues, the contribution of a residue to the structural properties of another residue is smaller for greater separation in the protein‐sequence distance between the two residues. We show that the guided‐learning method makes a 2–4% reduction in 10‐fold cross‐validated mean absolute errors (MAE) for predicting residue solvent accessibility and backbone torsion angles, regardless of the size of database, the number of hidden layers and the size of input windows. This together with introduction of two‐layer neural network with a bipolar activation function leads to a new method that has a MAE of 0.11 for residue solvent accessibility, 36° for ψ, and 22° for ϕ. The method is available as a Real‐SPINE 3.0 server inhttp://sparks.informatics.iupui.edu. Proteins 2009.Keywords
This publication has 64 references indexed in Scilit:
- SP5: Improving Protein Fold Recognition by Using Torsion Angle Profiles and Profile-Based Gap Penalty ModelPLOS ONE, 2008
- MUSTER: Improving protein sequence profile–profile alignments by using multiple sources of structure informationProteins-Structure Function and Bioinformatics, 2008
- Improved generalization of neural classifiers with enforced internal representationNeurocomputing, 2007
- Real‐SPINE: An integrated system of neural networks for real‐value prediction of protein structural propertiesProteins-Structure Function and Bioinformatics, 2007
- A new interactive model for improving the learning performance of back propagation neural networkAutomation in Construction, 2007
- Achieving 80% ten‐fold cross‐validated accuracy for secondary structure prediction by large‐scale trainingProteins-Structure Function and Bioinformatics, 2007
- Phase space learning in an autonomous dynamical neural networkNeurocomputing, 2006
- A Decision-Theoretic Generalization of On-Line Learning and an Application to BoostingJournal of Computer and System Sciences, 1997
- A learning algorithm for multilayered neural networks based on linear least squares problemsNeural Networks, 1993
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983