Predicting enzyme family class in a hybridization space
- 1 November 2004
- journal article
- Published by Wiley in Protein Science
- Vol. 13 (11) , 2857-2863
- https://doi.org/10.1110/ps.04981104
Abstract
Given the sequence of a protein, how can we predict whether it is an enzyme or a non-enzyme? If it is, what enzyme family class it belongs to? Because these questions are closely relevant to the biological function of a protein and its acting object, their importance is self-evident. Particularly with the explosion of protein sequences entering into data banks and the relatively much slower progress in using biochemical experiments to determine their functions, it is highly desired to develop an automated method that can be used to give fast answers to these questions. By hybridizing the gene ontology and pseudo-amino-acid composition, we have introduced a new method that is called GO-PseAA predictor and operate it in a hybridization space. To avoid redundancy and bias, demonstrations were performed on a data set in which none of the proteins in an individual class has > or =40% sequence identity to any other. The overall success rate thus obtained by the jackknife cross-validation test in identifying enzyme and non-enzyme was 93%, and that in identifying the enzyme family was 94% for the following six main Enzyme Commission (EC) classes: (1) oxidoreductase, (2) transferase, (3) hydrolase, (4) lyase, (5) isomerase, and (6) ligase. The corresponding rates by the independent data set test were 98% and 97%, respectively.Keywords
This publication has 23 references indexed in Scilit:
- Prediction and classification of protein subcellular location—sequence‐order effect and pseudo amino acid compositionJournal of Cellular Biochemistry, 2003
- Predicting protein quaternary structure by pseudo amino acid compositionProteins-Structure Function and Bioinformatics, 2003
- Prediction of Enzyme Family ClassesJournal of Proteome Research, 2003
- Subcellular location prediction of apoptosis proteinsProteins-Structure Function and Bioinformatics, 2002
- Prediction of protein cellular attributes using pseudo‐amino acid compositionProteins-Structure Function and Bioinformatics, 2001
- A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition spaceProteins-Structure Function and Bioinformatics, 1995
- Prediction of Protein Structural ClassesCritical Reviews in Biochemistry and Molecular Biology, 1995
- A Joint Prediction of the Folding Types of 1490 Human Proteins from their Genetic CodonsJournal of Theoretical Biology, 1993
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981
- Contribution of Hydrophobic Interactions to the Stability of the Globular Conformation of ProteinsJournal of the American Chemical Society, 1962