Improvement of protein secondary structure prediction using binary word encoding
- 1 January 1997
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 27 (1) , 36-46
- https://doi.org/10.1002/(sici)1097-0134(199701)27:1<36::aid-prot5>3.0.co;2-l
Abstract
We propose a binary word encoding to improve the protein secondary structure prediction. A binary word encoding encodes a local amino acid sequence to a binary word, which consists of 0 or 1. We use an encoding function to map an amino acid to 0 or 1. Using the binary word encoding, we can statistically extract the multiresidue information, which depends on more than one residue. We combine the binary word encoding with the GOR method, its modified version, which shows better accuracy, and the neural network method. The binary word encoding improves the accuracy of GOR by 2.8%. We obtain similar improvement when we combine this with the modified GOR method and the neural network method. When we use multiple sequence alignment data, the binary word encoding similarly improves the accuracy. The accuracy of our best combined method is 68.2%. In this paper, we only show improvement of the GOR and neural network method, we cannot say that the encoding improves the other methods. But the improvement by the encoding suggests that the multiresidue interaction affects the formation of secondary structure. In addition, we find that the optimal encoding function obtained by the simulated annealing method relates to non-polarity. This means that nonpolarity is important to the multiresidue interaction. Proteins 27:36–46Keywords
This publication has 21 references indexed in Scilit:
- Protein Secondary Structure Prediction Using Nearest-neighbor MethodsJournal of Molecular Biology, 1993
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Protein secondary structure prediction with a neural network.Proceedings of the National Academy of Sciences, 1989
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Further developments of protein secondary structure prediction using information theoryJournal of Molecular Biology, 1987
- Amino acid sequence homology applied to the prediction of protein secondary structures, and joint prediction with existing methodsBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1986
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978
- Triplet information in helix prediction applied to the analysis of super-secondary structuresJournal of Molecular Biology, 1977
- Prediction of protein conformationBiochemistry, 1974
- Conformational parameters for amino acids in helical, β-sheet, and random coil regions calculated from proteinsBiochemistry, 1974