Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justification.
- 1 March 1981
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 78 (3) , 1596-1600
- https://doi.org/10.1073/pnas.78.3.1596
Abstract
The periodic variations obtained by correlating the relative positions of purines and pyrimidines (and of the 4 bases thymine, cytosine, adenine and guanine) in a wide variety of genomes of wholly or partly known sequence suggest that there may be enough of an earlier comma-free coding system (i.e., only readable in 1 frame) still present to permit determination of the reading frame and approximate extent of the present protein coding stretches. The characteristics of these variations support the hypothesis that these primitive messages were formed of coding triplets having the form RNY (R = purine; Y = pyrimidine; and N = purine or pyrimidine). The base sequences and reading frames that have a minimal deviation from such a message are still good predictors of actual coding regions and reading frames in spite of the many mutations that have occurred since such a genetic code was last in use. In fact, the right frame for almost all the proteins in a number of viruses and various prokaryotes and eukaryotes is deduced purely from purine/pyrimidine information and not by using the normal start and stop signals.This publication has 32 references indexed in Scilit:
- Organization of the recA gene of Escherichia coli.Proceedings of the National Academy of Sciences, 1980
- The complete sequence of a chromosomal mouse α-globin gene reveals elements conserved throughout vertebrate evolutionCell, 1979
- Cloning and nucleotide sequence of DNA coding for bovine preproparathyroid hormone.Proceedings of the National Academy of Sciences, 1979
- Nucleotide sequence of cloned cDNA for bovine corticotropin-β-lipotropin precursorNature, 1979
- Sequence of three introns in the chick ovalbumin geneNature, 1979
- The Genome of Simian Virus 40Science, 1978
- Nucleotide sequence of bacteriophage fd DNANucleic Acids Research, 1978
- Complete nucleotide sequence of the 5′ noncoding region of human α- and β-globin mRNACell, 1977
- Overlapping genes in bacteriophage φX174Nature, 1976
- Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase geneNature, 1976