Prediction of protein surface accessibility with information theory
- 9 January 2001
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 42 (4) , 452-459
- https://doi.org/10.1002/1097-0134(20010301)42:4<452::aid-prot40>3.0.co;2-q
Abstract
A new, simple method based on information theory is introduced to predict the solvent accessibility of amino acid residues in various states defined by their different thresholds. Prediction is achieved by the application of information obtained from a single amino acid position or pair-information for a window of seventeen amino acids around the desired residue. Results obtained by pairwise information values are better than results from single amino acids. This reinforces the effect of the local environment on the accessibility of amino acid residues. The prediction accuracy of this method in a jackknife test system for two and three states is better than 70 and 60%, respectively. A comparison of the results with those reported by others involving the same data set also testifies to a better prediction accuracy in our case. Proteins 2001;42:452–459.Keywords
This publication has 42 references indexed in Scilit:
- Environment and exposure to solvent of protein atoms. Lysozyme and insulinPublished by Elsevier ,2004
- Use of Amino Acid Environment-dependent Substitution Tables and Conformational Propensities in Structure Prediction from Aligned Sequences of Homologous Proteins I. Solvent Accessibility ClassesJournal of Molecular Biology, 1994
- Enlarged representative set of protein structuresProtein Science, 1994
- Redefining the goals of protein secondary structure predictionJournal of Molecular Biology, 1994
- Solvent accessible surface area and excluded volume in proteinsJournal of Molecular Biology, 1984
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- The nature of the accessible and buried surfaces in proteinsJournal of Molecular Biology, 1976