Real value prediction of solvent accessibility from amino acid sequence
- 6 February 2003
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 50 (4) , 629-635
- https://doi.org/10.1002/prot.10328
Abstract
The solvent accessibility of amino acid residues has been predicted in the past by classifying them into exposure states with varying thresholds. This classification provides a wide range of values for the accessible surface area (ASA) within which a residue may fall. Thus far, no attempt has been made to predict real values of ASA from the sequence information without a priori classification into exposure states. Here, we present a new method with which to predict real value ASAs for residues, based on neighborhood information. Our real value prediction neural network could estimate the ASA for four different nonhomologous, nonredundant data sets of varying size, with 18.0–19.5% mean absolute error, defined as per residue absolute difference between the predicted and experimental values of relative ASA. Correlation between the predicted and experimental values ranged from 0.47 to 0.50. It was observed that the ASA of a residue could be predicted within a 23.7% mean absolute error, even when no information about its neighbors is included. Prediction of real values answers the issue of arbitrary choice of ASA state thresholds, and carries more information than category prediction. Prediction error for each residue type strongly correlates with the variability in its experimental ASA values. Proteins 2003;50:629–635.Keywords
This publication has 22 references indexed in Scilit:
- NETASA: neural network based prediction of solvent accessibilityBioinformatics, 2002
- Prediction of coordination number and relative solvent accessibility in proteinsProteins-Structure Function and Bioinformatics, 2002
- Protein threading by learningProceedings of the National Academy of Sciences, 2001
- Prediction of protein surface accessibility with information theoryProteins-Structure Function and Bioinformatics, 2001
- Predicting residue solvent accessibility from protein sequence by considering the sequence environment.Protein Engineering, Design and Selection, 2000
- Application of multiple sequence alignment profiles to improve protein secondary structure predictionProteins-Structure Function and Bioinformatics, 2000
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Protein Structure PredictionScience, 1996
- Improved prediction of protein secondary structure by use of sequence profiles and neural networks.Proceedings of the National Academy of Sciences, 1993
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983