Universal rule for coding sequence construction: TA/CG deficiency-TG/CT excess.
- 1 December 1988
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 85 (24) , 9630-9634
- https://doi.org/10.1073/pnas.85.24.9630
Abstract
Each coding sequence is a finite resource as to the number and composition of four bases. Accordingly, the excessive recurrence of one base oligomer entails the noticeable underrepresentation by the other, so that if the former is the same in most, if not all, of the coding sequences, the latter too must necessarily be the same in all. Indeed, a previous series of studies on 20-odd divergent coding sequences established CTG as one of the most frequently recurring base trimers (if not the most frequent), and this excess was compensated by the underrepresentation by CG and TA dimer-containing base trimers. In this study, I have analyzed three additional coding sequences and reanalyzed one previously studied coding sequence. These four, derived from man, a plant, and a fish, were of variously lopsided base compositions that were not at all conducive to high recurrences of either CT dimer or CT and TG. Yet, the excess of CT and TG dimers accompanied by complementary deficiency of CG and TA dimers emerged as the common rule. Thus, I propose the above as the universal rule of coding sequence construction. The underrepresentation by CG and TA dimers within coding sequences explains why regulatory signals in intergenic spacers are of two kinds: one, TA dimer rich; and the other, CG dimer rich.This publication has 12 references indexed in Scilit:
- Codon preference is but an illusion created by the construction principle of coding sequences.Proceedings of the National Academy of Sciences, 1988
- Early genes that were oligomeric repeats generated a number of divergent domains on their own.Proceedings of the National Academy of Sciences, 1987
- Evolution from primordial oligomeric repeats to modern coding sequencesJournal of Molecular Evolution, 1987
- Sequence and Expression of Human Estrogen Receptor Complementary DNAScience, 1986
- Primary structure of bovine thyroglobulin deduced from the sequence of its 8,431-base complementary DNANature, 1985
- An H1 histone gene from rainbow trout (Salmo gairdnerii)Journal of Molecular Evolution, 1985
- Isolation and DNA sequence of a full-length cDNA clone for human X chromosome-encoded phosphoglycerate kinase.Proceedings of the National Academy of Sciences, 1983
- Genetic code: Mitochondrial codes and evolutionNature, 1983
- Inactive X chromosome DNA does not function in DNA-mediated cell transformation for the hypoxanthine phosphoribosyltransferase gene.Proceedings of the National Academy of Sciences, 1980
- X inactivation, differentiation, and DNA methylationCytogenetic and Genome Research, 1975