Predicting residue solvent accessibility from protein sequence by considering the sequence environment.
Open Access
- 1 September 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 13 (9) , 607-609
- https://doi.org/10.1093/protein/13.9.607
Abstract
The solvent accessibility of each residue is predicted on the basis of the protein sequence. A set of 338 monomeric, non-homologous and high-resolution protein crystal structures is used as a learning set and a jackknife procedure is applied to each entry. The prediction is based on the comparison of the observed and the average values of the solvent-accessible area. It appears that the prediction accuracy is significantly improved by considering the residue types preceding and/or following the residue whose accessibility must be predicted. In contrast, the separate treatment of different secondary structural types does not improve the quality of the prediction. It is furthermore shown that the residue accessibility is much better predicted in small than in larger proteins. Such a discrepancy must be carefully considered in any algorithm for predicting residue accessibility.Keywords
This publication has 13 references indexed in Scilit:
- The bottom line for prediction of residue solvent accessibility.Protein Engineering, Design and Selection, 1999
- Easy method to predict solvent accessibility from multiple protein sequence alignmentsProteins-Structure Function and Bioinformatics, 1998
- Cystine knotsCurrent Opinion in Structural Biology, 1995
- Increasing thermal stability of subtilisin from mutations suggested by strongly interacting side-chain clustersProtein Engineering, Design and Selection, 1995
- Enlarged representative set of protein structuresProtein Science, 1994
- Predicting surface exposure of amino acids from protein sequenceProtein Engineering, Design and Selection, 1990
- Flexibility plot of proteinsProtein Engineering, Design and Selection, 1989
- Prediction of chain flexibility in proteinsThe Science of Nature, 1985
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977