Predicting Enzyme Subclass by Functional Domain Composition and Pseudo Amino Acid Composition
- 8 April 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 4 (3) , 967-971
- https://doi.org/10.1021/pr0500399
Abstract
As a continuous effort to use the sequence approach to identify enzymatic function at a deeper level, investigations are extended from the main enzyme classes (Protein Sci.2004, 13, 2857−2863) to their subclasses. This is indispensable if we wish to understand the molecular mechanism of an enzyme at a deeper level. For each of the 6 main enzyme classes (i.e., oxidoreductase, transferase, hydrolase, lyase, isomerase, and ligase), a subclass training dataset is constructed. To reduce homologous bias, a stringent cutoff was imposed that all the entries included in the datasets have less than 40% sequence identity to each other. To catch the core feature that is intimately related to the biological function, the sample of a protein is represented by hybridizing the functional domain composition and pseudo amino acid composition. On the basis of such a hybridization representation, the FunD-PseAA predictor is established. It is demonstrated by the jackknife cross-validation tests that the overall success rate in identifying the 21 subclasses of oxidoreductases is above 86%, and the corresponding rates in identifying the subclasses of the other 5 main enzyme classes are 94−97%. The high success rates imply that the FunD-PseAA predictor may become a useful tool in bioinformatics and proteomics of the post-genomic era. Keywords: ENZYME database • 40% cutoff • functional domain • pseudo amino acid composition • ISort predictor • FunD-PseAA predictor • bioinformatics • proteomicsKeywords
This publication has 12 references indexed in Scilit:
- Validation of qualitative models of genetic regulatory networks by model checking: analysis of the nutritional stress response in Escherichia coliBioinformatics, 2005
- Corrigendum to “Predicting protein structural class by functional domain composition” [Biochem. Biophys. Res. Commun. 321 (2004) 1007–1009]Biochemical and Biophysical Research Communications, 2005
- Enzyme family classification by support vector machinesProteins-Structure Function and Bioinformatics, 2004
- Subcellular location prediction of apoptosis proteinsProteins-Structure Function and Bioinformatics, 2002
- Enzyme Function Less Conserved than AnticipatedJournal of Molecular Biology, 2002
- Some insights into protein structural class predictionProteins-Structure Function and Bioinformatics, 2001
- Efficient gene activation in cultured mammalian cells mediated by FLP recombinase-expressing recombinant adenovirusNucleic Acids Research, 2001
- Prediction of protein cellular attributes using pseudo‐amino acid compositionProteins-Structure Function and Bioinformatics, 2001
- A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition spaceProteins-Structure Function and Bioinformatics, 1995
- Prediction of Protein Structural Classes from Amino Acid CompositionsPublished by Springer Nature ,1989