Distribution of Base Pair Repeats in Coding and Noncoding DNA Sequences
- 22 December 1997
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review Letters
- Vol. 79 (25) , 5182-5185
- https://doi.org/10.1103/physrevlett.79.5182
Abstract
We analyze the histograms for the lengths of the 16 possible distinct repeats of identical dimers, known as dimeric tandem repeats, in DNA sequences. For coding regions, the probability of finding a repetitive sequence of copies of a particular dimer decreases exponentially as increases. For the noncoding regions, the distribution functions for most of the 16 dimers have long tails and can be approximated by power-law functions, while for coding DNA, they can be well fit by a first-order Markov process. We propose a model, based on known biophysical processes, which leads to the observed probability distribution functions for noncoding DNA. We argue that this difference in the shape of the distribution functions between coding and noncoding DNA arises from the fact that noncoding DNA is more tolerant to evolutionary mutational alterations than coding DNA.
Keywords
This publication has 15 references indexed in Scilit:
- Quantification of DNA Patchiness Using Long-Range Correlation MeasuresBiophysical Journal, 1997
- Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysisPhysical Review E, 1995
- Characterizing Long-Range Correlations in DNA Sequences from Wavelet AnalysisPhysical Review Letters, 1995
- High resolution of human evolutionary trees with polymorphic microsatellitesNature, 1994
- Simple repeat DNA is not replicated simplyNature Genetics, 1994
- Characteristics of the Large (dA).(dT) Homopolymer Tracts in D. discoideum Gene Flanking and Intron SequencesJournal of Biomolecular Structure and Dynamics, 1993
- Ubiquitous somatic mutations in simple repeated sequences reveal a new mechanism for colonic carcinogenesisNature, 1993
- Long-range correlations in nucleotide sequencesNature, 1992
- Long-Range Correlation and Partial 1/ f α Spectrum in a Noncoding DNA SequenceEurophysics Letters, 1992
- Implications of thermodynamics of protein folding for evolution of primary sequencesNature, 1990