Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins
- 1 June 1991
- journal article
- research article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 32 (6) , 504-510
- https://doi.org/10.1007/bf02102652
Abstract
We have analyzed the correlation that exists between the GC levels of third and first or second codon position for about 1400 human coding sequences. The linear relationship that was found indicates that the large differences in GC level of third codon positions of human genes are paralleled by smaller differences in GC levels of first and second codon positions. Whereas third codon position differences correspond to very large differences in codon usage within the human genome, the first and second codon position differences correspond to smaller, yet very remarkable, differences in the amino acid composition of encoded proteins. Because GC levels of codon positions are linearly correlated with the GC levels of the isochores harboring the corresponding genes, both codon usage and amino acid composition are different for proteins encoded by genes located in isochores of different GC levels. Furthermore, we have also shown that a linear relationship with a unity slope and a correlation coefficient of 0.77 exists between GC levels of introns and exons from the 238 human genes currently available for this analysis. Introns are, however, about 5% lower in GC, on average, than exons from the same genes.Keywords
This publication has 13 references indexed in Scilit:
- Compositional properties of nuclear genes from cold-blooded vertebratesJournal of Molecular Evolution, 1991
- The compositional properties of human genesJournal of Molecular Evolution, 1991
- THE ISOCHORE ORGANIZATION OF THE HUMAN GENOMEAnnual Review of Genetics, 1989
- Compositional constraints and genome evolutionJournal of Molecular Evolution, 1986
- Codon usage and genome compositionJournal of Molecular Evolution, 1985
- ACNUC – a portable retrieval system for nucleic acid sequence databases: logical and physical designs and usageBioinformatics, 1985
- The Mosaic Genome of Warm-Blooded VertebratesScience, 1985
- Working of the genetic codeTrends in Biochemical Sciences, 1980
- Codon catalog usage and the genome hypothesisNucleic Acids Research, 1980
- CORRELATION BETWEEN BASE COMPOSITION OF DEOXYRIBONUCLEIC ACID AND AMINO ACID COMPOSITION OF PROTEINProceedings of the National Academy of Sciences, 1961