Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes
- 1 August 1984
- journal article
- research article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 20 (3-4) , 313-321
- https://doi.org/10.1007/bf02104737
Abstract
Three outstanding properties uniquely qualify repeats of base oligomers as the primordial coding sequences of all polypeptide chains. First, when compared with randomly generated base sequences in general, they are more likely to have long open reading frames. Second, periodical polypeptide chains specified by such repeats are more likely to assume either α-helical or β-sheet secondary structures than are polypeptide chains of random sequence. Third, provided that the number of bases in the oligomeric unit is not a multiple of 3, these internally repetitious coding sequences are impervious to randomly sustained base substitutions, deletions, and insertions. This is because the recurring periodicity of their polypeptide chains is given by three consecutive copies of the oligomeric unit translated in three different reading frames. Accordingly, when one reading frame is open, the other two are automatically open as well, all three being capable of coding for polypeptide chains of identical periodicity. Under this circumstance, a frame shift due to the deletion or insertion of a number of bases that is not a multiple of 3 fails to alter the downstream amino acid sequence, and even a base change causing premature chain-termination can silence only one of the three potential coding units. Newly arisen coding sequences in modern organisms are oligomeric repeats, and most of the older genes retain various vestiges of their original internal repetitions. Some of the genes (e.g., oncogenes) have even inherited the property of being impervious to randomly sustained base changes.Keywords
This publication has 28 references indexed in Scilit:
- Evolution of the genetic apparatusPublished by Elsevier ,2004
- Evolution of the albumin: α-fetoprotein ancestral gene from the amplification of a 27 nucleotide sequenceJournal of Molecular Biology, 1984
- Close similarity of epidermal growth factor receptor and v-erb-B oncogene protein sequencesNature, 1984
- Simple Construction of Human c-myc Gene Implicated in B-Cell Neoplasmas and Its Relationship with Avian v-myc and Human LymphokinesScandinavian Journal of Immunology, 1983
- Structure of the plasmodium knowlesi gene coding for the circumsporozoite proteinCell, 1983
- Nucleotide sequence of cloned cDNA of human c-myc oncogeneNature, 1983
- Frameshift and intragenic suppressor mutations in a rous sarcoma provirus suggest src encodes two proteinsCell, 1983
- Genetic code: Mitochondrial codes and evolutionNature, 1983
- Events in the Evolution of Pre-ProinsulinScience, 1982
- Catalysis of accurate poly(C)-directed synthesis of 3′-5′-linked oligoguanylates by Zn2+Journal of Molecular Biology, 1980