Clustering of Identical Oligomers in Coding and Noncoding DNA Sequences
- 1 August 1999
- journal article
- research article
- Published by Taylor & Francis in Journal of Biomolecular Structure and Dynamics
- Vol. 17 (1) , 79-87
- https://doi.org/10.1080/07391102.1999.10508342
Abstract
We develop a quantitative method for analyzing repetitions of identical short oligomers in coding and noncoding DNA sequences. We analyze sequences presently available in the GenBank separately for primate, mammal, vertebrate, rodent, invertebrate and plant taxonomic partitions. We find that some oligomers “cluster” more than they would if randomly distributed, while other oligomers “repel” each other. To quantify this degree of clustering, we define clustering measures. We find that (i) clustering significantly differs in coding and noncoding DNA; (ii) in most cases, monomers, dimers and tetramers cluster in noncoding DNA but appear to repel each other in coding DNA. (iii) The degree of clustering for different sources (primates, invertebrates, and plants) is more conserved among these sources in the case of coding DNA than in the case of noncoding DNA. (iv) In contrast to other oligomers, we find that trimers always prefer to cluster, (v) Clustering of each particular oligomer is conserved within the same organism.Keywords
This publication has 27 references indexed in Scilit:
- Distribution of Base Pair Repeats in Coding and Noncoding DNA SequencesPhysical Review Letters, 1997
- Analysis of Genomic Patchiness ofHaemophilus influenzaeandSaccharomyces cerevisiaeChromosomesJournal of Theoretical Biology, 1996
- Conserved residues and the mechanism of protein foldingNature, 1996
- Impact of Local and Non-local Interactions on Thermodynamics and Kinetics of Protein FoldingJournal of Molecular Biology, 1995
- Trinucleotide repeat expansion in neurological diseaseAnnals of Neurology, 1994
- Population dynamics of DNA fingerprint patterns within and between populationsGenetics Research, 1994
- Heritable unstable DNA sequencesNature Genetics, 1992
- Implications of thermodynamics of protein folding for evolution of primary sequencesNature, 1990
- Strong adenine clustering in nucleotide sequencesJournal of Theoretical Biology, 1980
- Ghost fields, pair connectedness, and scaling: exact results in one-dimensional percolationJournal of Physics A: General Physics, 1977