Using pseudo amino acid composition to predict protein structural classes: Approached with complexity measure factor
- 20 January 2006
- journal article
- research article
- Published by Wiley in Journal of Computational Chemistry
- Vol. 27 (4) , 478-482
- https://doi.org/10.1002/jcc.20354
Abstract
The structural class is an important feature widely used to characterize the overall folding type of a protein. How to improve the prediction quality for protein structural classification by effectively incorporating the sequence-order effects is an important and challenging problem. Based on the concept of the pseudo amino acid composition [Chou, K. C. Proteins Struct Funct Genet 2001, 43, 246; Erratum: Proteins Struct Funct Genet 2001, 44, 60], a novel approach for measuring the complexity of a protein sequence was introduced. The advantage by incorporating the complexity measure factor into the pseudo amino acid composition as one of its components is that it can catch the essence of the overall sequence pattern of a protein and hence more effectively reflect its sequence-order effects. It was demonstrated thru the jackknife crossvalidation test that the overall success rate by the new approach was significantly higher than those by the others. It has not escaped our notice that the introduction of the complexity measure factor can also be used to improve the prediction quality for, among many other protein attributes, subcellular localization, enzyme family class, membrane protein type, and G-protein couple receptor type. © 2006 Wiley Periodicals, Inc. J Comput Chem 27: 478–482, 2006Keywords
This publication has 58 references indexed in Scilit:
- Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein typesBiochemical and Biophysical Research Communications, 2005
- Using supervised fuzzy clustering to predict protein structural classesBiochemical and Biophysical Research Communications, 2005
- Boosting classifier for predicting protein domain structural classBiochemical and Biophysical Research Communications, 2005
- SLLE for predicting membrane protein typesJournal of Theoretical Biology, 2005
- Using GO-PseAA predictor to predict enzyme sub-classBiochemical and Biophysical Research Communications, 2004
- Identify catalytic triads of serine hydrolases by support vector machinesJournal of Theoretical Biology, 2004
- Insights from modelling the 3D structure of the extracellular domain of α7 nicotinic acetylcholine receptorBiochemical and Biophysical Research Communications, 2004
- Prediction of protein cellular attributes using pseudo‐amino acid compositionProteins-Structure Function and Bioinformatics, 2001
- Prediction of Protein Subcellular Locations by Incorporating Quasi-Sequence-Order EffectBiochemical and Biophysical Research Communications, 2000
- A Joint Prediction of the Folding Types of 1490 Human Proteins from their Genetic CodonsJournal of Theoretical Biology, 1993