Nucleotide, dinucleotide and trinucleotide frequencies explain patterns observed in chaos game representations of DNA sequences
- 1 January 1993
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 21 (10) , 2487-2491
- https://doi.org/10.1093/nar/21.10.2487
Abstract
The chaos game representation (CGR) is a scatter plot derived from a DNA sequence, with each point of the plot corresponding to one base of the sequence. If the DNA sequence were a random collection of bases, the CGR would be a uniformly filled square; conversely, any patterns visible in the CGR represent some pattern (information) in the DNA sequence. In this paper, patterns previously observed in a variety of DNA sequences are explained solely in terms of nucleotide, dinucleotide and trinucleotide frequencies.Keywords
This publication has 10 references indexed in Scilit:
- Mathematical characterization of Chaos Game RepresentationJournal of Molecular Biology, 1992
- Chaos game representation of coding regions of human globin genes and alcohol dehydrogenase genes of phylogenetically divergent speciesJournal of Molecular Evolution, 1992
- Chaos game representation of gene structureNucleic Acids Research, 1990
- The analysis of intron data and their use in the detection of short signalsJournal of Molecular Evolution, 1987
- A statistical analysis of nucleotide sequences of introns and exons in human genes.Molecular Biology and Evolution, 1987
- Markov chain analysis finds a significant influence of neighboring bases on the occurrence of a base in eucaryotic nuclear DNA sequences both protein-coding and noncodingJournal of Molecular Evolution, 1985
- A Markov analysis of DNA sequencesJournal of Theoretical Biology, 1983
- Codon catalog usage and the genome hypothesisNucleic Acids Research, 1980
- DNA methylation and the frequency of CpG in animal DNANucleic Acids Research, 1980
- ENZYMATIC SYNTHESIS OF DEOXYRIBONUCLEIC ACID .8. FREQUENCIES OF NEAREST NEIGHBOR BASE SEQUENCES IN DEOXYRIBONUCLEIC ACID1961