Multi‐modality of pI distribution in whole proteome
- 17 January 2006
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 6 (2) , 449-455
- https://doi.org/10.1002/pmic.200500221
Abstract
Multi‐modality of pI distribution is a common feature in different whole proteomes. Some researchers considered it relate to the proteins with different subcellular locations, indicating the result of natural selection. We explored the pI distribution of predicted proteomes (including animals, plants, bacterium, archaeans) and random proteome [random protein sequences constructed according to the special amino acid composition and molecular weight (MW) distribution of human predicted proteome]. Our results suggest that the multi‐modality is the result of discrete pKR values for different amino acids. Amino acid composition and MW distribution of a proteome also contributes to the specific pI distribution. Although protein subcellular location was related to pI value, our analyses revealed that comparing with the random proteome, neither the multi‐modality phenomenon nor the distribution bias of pI values is caused by subcellular location. It seems that the multi‐modality distribution is just a mathematical fun. The blank region near the neutral pI was caused by the absence of amino acids with neutral pKR, and suggests that the selection of amino acids with ionizable side chain might be restricted by the requirement for a special pH environment during the origin of life. From this point of view, the special distribution was the result of natural selection.Keywords
This publication has 13 references indexed in Scilit:
- Global analysis of predicted proteomes: Functional adaptation of physical propertiesProceedings of the National Academy of Sciences, 2004
- The modal distribution of protein isoelectric points reflects amino acid properties rather than sequence evolutionProteomics, 2004
- The COG database: an updated version includes eukaryotesBMC Bioinformatics, 2003
- Addressing protein localization within the nucleusThe EMBO Journal, 2002
- Large-scale identification of mammalian proteins localized to nuclear sub-compartmentsHuman Molecular Genetics, 2001
- Whole Proteome pI Values Correlate with Subcellular Localizations of Proteins for Organisms within the Three Domains of LifeGenome Research, 2001
- The COG database: new developments in phylogenetic classification of proteins from complete genomesNucleic Acids Research, 2001
- A Genomic Perspective on Protein FamiliesScience, 1997
- Reference points for comparisons of two‐dimensional maps of proteins from different human cell types defined in a pH scale where isoelectric points correlate with polypeptide compositionsElectrophoresis, 1994
- The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequencesElectrophoresis, 1993