Distribution and Evolution of Sequence Characteristics in theE. coliGenome
- 1 October 1986
- journal article
- research article
- Published by Taylor & Francis in Journal of Biomolecular Structure and Dynamics
- Vol. 4 (2) , 291-307
- https://doi.org/10.1080/07391102.1986.10506347
Abstract
The mean (G+C) composition (51.0%) and standard deviation (±3.8%) of published DNA sequences accounting for 10% of the E. coli genome is in excellent agreement with the principal overall distribution determined by high resolution melting. While differences in base and neighbor characteristics are small and uniform throughout all regions of the genome, it is found that the (G+C) content of sequences varies in segmented fashion within boundaries corresponding to coding (53% G+C) and noncoding (46% G+C) regions; with variances in the latter being six-fold greater than in coding regions. The variance in different regions shows a strong negative dependence on (G+C) content of the region, reflecting the condition that A-T and G-C base pairs are preferred neighbors of A-T and C-G pairs, respectively; with the bias increasing with decreasing (G+C) content. Neighbor analysis indicates the most extreme positive biases occur in AA, TT, GC and CG throughout all regions, but particularly in noncoding regions. Extraordinary numbers of oligomeric strings of (A)n, etc., are the further consequence of this bias. These and other characteristics point to the existence of inherent biases in neighbor frequencies levied during replication or repair, and which reflect, in turn, neighbor influences during mutation. The bias in codon usage noted by Grantham and others is seen here as due, in part, to the adaptation of coding sequences to this microenvironment through selection among synonymous codons so as to preserve inherent neighbor biases.This publication has 29 references indexed in Scilit:
- Thermal denaturation of DNA molecules: A comparison of theory with experimentPublished by Elsevier ,2002
- Correlation between thermal stability maps and genetic maps of double-stranded DNAsJournal of Theoretical Biology, 1983
- Correlation of Tm and sequence of DNA duplexes with ΔH computed by an improved empirical potential methodBiopolymers, 1983
- Stabilities of nearest‐neighbor doublets in double‐helical DNA determined by fitting calculated melting profiles to observed profilesBiopolymers, 1981
- Analysis of high-resolution melting (thermal dispersion) of DNA. MethodsBiopolymers, 1980
- Studies on the biochemical basis of spontaneous mutationJournal of Molecular Biology, 1977
- Theoretical models for heterogeneity of base composition in DNAJournal of Theoretical Biology, 1974
- A denaturation map of the λ phage DNA molecule determined by electron microscopyJournal of Molecular Biology, 1966
- New Approaches to Bacterial TaxonomyAnnual Review of Microbiology, 1963
- The dispersion of the hyperchromic effect in thermally induced transitions of nucleic acidsJournal of Molecular Biology, 1962