Prediction of protein cellular attributes using pseudo‐amino acid composition
Top Cited Papers
- 7 March 2001
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 43 (3) , 246-255
- https://doi.org/10.1002/prot.1035
Abstract
The cellular attributes of a protein, such as which compartment of a cell it belongs to and how it is associated with the lipid bilayer of an organelle, are closely correlated with its biological functions. The success of human genome project and the rapid increase in the number of protein sequences entering into data bank have stimulated a challenging frontier: How to develop a fast and accurate method to predict the cellular attributes of a protein based on its amino acid sequence? The existing algorithms for predicting these attributes were all based on the amino acid composition in which no sequence order effect was taken into account. To improve the prediction quality, it is necessary to incorporate such an effect. However, the number of possible patterns for protein sequences is extremely large, which has posed a formidable difficulty for realizing this goal. To deal with such a difficulty, the pseudo‐amino acid composition is introduced. It is a combination of a set of discrete sequence correlation factors and the 20 components of the conventional amino acid composition. A remarkable improvement in prediction quality has been observed by using the pseudo‐amino acid composition. The success rates of prediction thus obtained are so far the highest for the same classification schemes and same data sets. It has not escaped from our notice that the concept of pseudo‐amino acid composition as well as its mathematical framework and biochemical implication may also have a notable impact on improving the prediction quality of other protein features. Proteins 2001;43:246–255.Keywords
This publication has 23 references indexed in Scilit:
- Using Discriminant Function for Prediction of Subcellular Location of Prokaryotic ProteinsBiochemical and Biophysical Research Communications, 1998
- Relation between amino acid composition and cellular location of proteinsJournal of Molecular Biology, 1997
- Protein Lipidation in Cell SignalingScience, 1995
- A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition spaceProteins-Structure Function and Bioinformatics, 1995
- Transmembrane helices predicted at 95% accuracyProtein Science, 1995
- Prediction of Protein Structural ClassesCritical Reviews in Biochemistry and Molecular Biology, 1995
- Discrimination of Intracellular and Extracellular Proteins Using Amino Acid Composition and Residue-pair FrequenciesJournal of Molecular Biology, 1994
- Myristylation and palmitylation of Src family members: The fats of the matterCell, 1994
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981
- Contribution of Hydrophobic Interactions to the Stability of the Globular Conformation of ProteinsJournal of the American Chemical Society, 1962