Cryptic simplicity in DNA is a major source of genetic variation
- 1 August 1986
- journal article
- Published by Springer Nature in Nature
- Vol. 322 (6080) , 652-656
- https://doi.org/10.1038/322652a0
Abstract
DNA regions which are composed of a single or relatively few short sequence motifs usually in tandem ('pure simple sequences') have been reported in the genomes of diverse species, and have been implicated in a range of functions including gene regulation, signals for gene conversion and recombination, and the replication of telomeres. They are thought to accumulate by DNA slippage and mispairing during replication and recombination or extension of single-strand ends. In order to systematize the range of DNA simplicity and the genetic nature of the regions that are simple, we have undertaken an extensive computer search of the DNA sequence library of the European Molecular Biology Laboratory (EMBL). We show here that nearly all possible simple motifs occur 5-10 times more frequently than equivalent random motifs. Furthermore, a new computer algorithm reveals the widespread occurrence of significantly high levels of a new type of 'cryptic simplicity' in both coding and noncoding DNA. Cryptically simple regions are biased in nucleotide composition and consist of scrambled arrangements of repetitive motifs which differ within and between species. The universal existence of DNA simplicity from monotonous arrays of single motifs to variable permutations of relatively short-lived motifs suggests that ubiquitous slippage-like mechanisms are a major source of genetic variation in all regions of the genome, not predictable by the classical mutation process.Keywords
This publication has 38 references indexed in Scilit:
- Conservation and divergence in multigene families: alternatives to selection and driftPhilosophical Transactions of the Royal Society of London. B, Biological Sciences, 1986
- Rates of molecular evolution: The hominoid slowdownBioEssays, 1985
- Hypervariable ‘minisatellite’ regions in human DNANature, 1985
- THE MOLECULAR STRUCTURE OF CENTROMERES AND TELOMERESAnnual Review of Biochemistry, 1984
- Simple sequences are ubiquitous repetitive components of eukaryotic genomesNucleic Acids Research, 1984
- A novel repeated element with Z-DNA-forming potential is widely found in evolutionarily diverse eukaryotic genomes.Proceedings of the National Academy of Sciences, 1982
- Homocopolymer sequences in the spacer of a sea urchin histone gene repeat are sensitive to S1 nucleaseNature, 1982
- A history of the human fetal globin gene duplicationCell, 1981
- Molecular structure of a left-handed double helical DNA fragment at atomic resolutionNature, 1979
- Chromosomal Subunits in Active Genes Have an Altered ConformationScience, 1976