Statistical characterization of nucleic acid sequence functional domains
- 1 January 1983
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 11 (7) , 2205-2220
- https://doi.org/10.1093/nar/11.7.2205
Abstract
It has long been recognized that various genome classes were distinguishable on the basis of base composition and nearest neighbor frequencies. In addition Grantham et al. (8) have recently presented evidence that these distinctions are preserved at the level of codon usage. As discussed in this report it is now clear that these and related statistics can uniquely characterize the various functional domains of the genome. In particular peptide coding, intervening segments, structural RNA coding and mitochondrial domains of the vertebrate genome are uniquely characterizable. The statistical measures not only reflect understood functional differences among these domains but suggest others. The ability of these simple statistics of nucleic acid sequences to reflect so much of the encoded complex pattern information and/or effects of selective constraints is somewhat surprising. Here, we investigated the statistical measures most distinctive of the various domains and then linked them to our current understandings in so far as possible.Keywords
This publication has 34 references indexed in Scilit:
- A + T-rich linkers define functional domains in eukaryotic DNANature, 1982
- Structure of a B-DNA dodecamerJournal of Molecular Biology, 1981
- Sequence-dependent variation in the conformation of DNAJournal of Molecular Biology, 1981
- DNA methylation and control of gene expressionNature, 1981
- Comparative biosequence metricsJournal of Molecular Evolution, 1981
- Organization of chimeras between filamentous bacteriophage f1 and plasmid pSC101Journal of Molecular Biology, 1980
- Estimating the total number of nucleotide substitutions since the common ancestor of a pair of homologous genes: Comparison of several methods and three beta hemoglobin messenger RNA'sJournal of Molecular Evolution, 1980
- Sequence of the lactose permease geneNature, 1980
- DNA methylation and the frequency of CpG in animal DNANucleic Acids Research, 1980
- Complete nucleotide sequence of an influenza virus haemagglutinin gene from cloned DNANature, 1979