Hydrophobicity, expressivity and aromaticity are the major trends of amino-acid usage in 999 Escherichia coli chromosome-encoded genes
- 1 January 1994
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 22 (15) , 3174-3180
- https://doi.org/10.1093/nar/22.15.3174
Abstract
Multivariate analysis of the amino-acid compositions of 999 chromosome-encoded proteins from Escherichia coli showed that three main factors influence the variability of amino-acid composition. The first factor was correlated with the global hydrophobicity of proteins, and it discriminated integral membrane proteins from the others. The second factor was correlated with gene expressivity, showing a bias in highly expressed genes towards amino-acids having abundant major tRNAs. Just as highly expressed genes have reduced codon diversity in protein coding sequences, so do they have a reduced diversity of amino-acid choice. This showed that translational constraints are important enough to affect the global amino-acid composition of proteins. The third factor was correlated with the aromaticity of proteins, showing that aromatic amino-acid content is highly variable.Keywords
This publication has 23 references indexed in Scilit:
- GenBankNucleic Acids Research, 1993
- Classification of Proteins into Groups Based on Amino Acid Composition and Other Characters. I. Angular DistributionThe Journal of Biochemistry, 1983
- Codon usage in bacteria: correlation with gene expressivityNucleic Acids Research, 1982
- A simple method for displaying the hydropathic character of a proteinJournal of Molecular Biology, 1982
- Correlation of the Amino Acid Composition of a Protein to Its Structural and Biological Characters1The Journal of Biochemistry, 1982
- Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genesJournal of Molecular Biology, 1981
- Codon frequencies in 119 individual genes confirm corsistent choices of degenerate bases according to genome typeNucleic Acids Research, 1980
- Genetic distances from mRNA sequencesThe Science of Nature, 1980
- Codon catalog usage and the genome hypothesisNucleic Acids Research, 1980
- Differential utilization of leucyl-tRNAs by Escherichia coli.Proceedings of the National Academy of Sciences, 1977