Compositional segmentation and long-range fractal correlations in DNA sequences
- 1 May 1996
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 53 (5) , 5181-5189
- https://doi.org/10.1103/physreve.53.5181
Abstract
A segmentation algorithm based on the Jensen-Shannon entropic divergence is used to decompose long-range correlated DNA sequences into statistically significant, compositionally homogeneous patches. By adequately setting the significance level for segmenting the sequence, the underlying power-law distribution of patch lengths can be revealed. Some of the identified DNA domains were uncorrelated, but most of them continued to display long-range correlations even after several steps of recursive segmentation, thus indicating a complex multi-length-scaled structure for the sequence. On the other hand, by separately shuffling each segment, or by randomly rearranging the order in which the different segments occur in the sequence, shuffled sequences preserving the original statistical distribution of patch lengths were generated. Both types of random sequences displayed the same correlation scaling exponents as the original DNA sequence, thus demonstrating that neither the internal structure of patches nor the order in which these are arranged in the sequence is critical; therefore, long-range correlations in nucleotide sequences seem to rely only on the power-law distribution of patch lengths. © 1996 The American Physical Society.Keywords
This publication has 25 references indexed in Scilit:
- Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysisPhysical Review E, 1995
- Understanding long-range correlations in DNA sequencesPhysica D: Nonlinear Phenomena, 1994
- Biological origins of long-range correlations and compositional variations in DNANucleic Acids Research, 1993
- Patchiness and Correlations in DNA SequencesScience, 1993
- Noisy NucleotidesScientific American, 1992
- DNA Shows Unexplained Patterns Writ LargeScience, 1992
- Humbling of world's AIDS researchersNature, 1992
- Evolution of long-range fractal correlations and 1/fnoise in DNA base sequencesPhysical Review Letters, 1992
- Long-range correlations in nucleotide sequencesNature, 1992
- Long-Range Correlation and Partial 1/ f α Spectrum in a Noncoding DNA SequenceEurophysics Letters, 1992