An analysis of protein folding type prediction by seed-propagated sampling and jackknife test
- 1 October 1995
- journal article
- research article
- Published by Springer Nature in Protein Journal
- Vol. 14 (7) , 583-593
- https://doi.org/10.1007/bf01886884
Abstract
In the development of methodology for statistical prediction of protein folding types, how to test the predicted results is a crucial problem. In addition to the resubstitution test in which the folding type of each protein from a training set is predicted based on the rules derived from the same set, cross-validation tests are needed. Among them, the single-testset method seems to be least reliable due to the arbitrariness in selecting the test set. Although the leaving-one-out (or jackknife) test is more objective and hence more reliable, it may cause a severe information loss by leaving a protein in turn out of the training set when its size is not large enough. In order to overcome the above drawback, a seed-propagated sampling approach is proposed that can be used to generate any number of simulated proteins with a desired type based on a given training set database. There is no need to make any predetermined assumption about the statistical distribution function of the amino acid frequencies. Combined with the existing cross-validation methods, the new technique may provide a more objective estimation for various protein-folding-type prediction methods.Keywords
This publication has 19 references indexed in Scilit:
- Protein folding classes: a geometric interpretation of the amino acid composition of globular proteinsProtein Engineering, Design and Selection, 1994
- Cross‐validation of protein structural class prediction using statistical clustering and neural networksProtein Science, 1993
- Monte Carlo simulation studies on the prediction of protein folding types from amino acid compositionBiophysical Journal, 1992
- A weighting method for predicting protein structural class from amino acid compositionEuropean Journal of Biochemistry, 1992
- A correlation‐coefficient method to predicting protein‐structural classes from amino acid compositionsEuropean Journal of Biochemistry, 1992
- Predicting protein secondary structure content: A tandem neural network approachJournal of Molecular Biology, 1992
- An optimization approach to predicting protein structural class from amino acid compositionProtein Science, 1992
- Prediction of protein structural class by discriminant analysisBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1986
- Prediction of protein structural class from the amino acid sequenceBiopolymers, 1986
- Structural patterns in globular proteinsNature, 1976