Lack of Biological Significance in the 'Linguistic Features' of Noncoding DNA--A Quantitative Analysis
Open Access
- 1 May 1996
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 24 (9) , 1676-1681
- https://doi.org/10.1093/nar/24.9.1676
Abstract
Recently, the application of two statistical methods (related to Zipf's distribution and Shannon's redundancy), called ‘linguistic’ tests, to the primary structure of DNA sequences of living organisms has excited considerable interest. Of particular importance is the claim that noncoding DNA sequences in eukaryotes display specific ‘linguistic’ features, being reminiscent of natural languages. Furthermore, this implies that noncoding regions of DNA may carry some new, thus far unknown, biological information which is revealed by these tests. In this paper these claims are tested quantitatively. With the aid of computer simulations of natural DNA sequences, and by applying the same ‘linguistic’ tests to both natural and artificial sequences, we investigate in detail the reasons of the appearance of the claimed ‘linguistic’ features and the associated differences between coding and noncoding DNAs. The presented results show quantitatively that the ‘linguistic’ tests failed to reveal any new biological information in (noncoding or coding) DNA.Keywords
This publication has 10 references indexed in Scilit:
- Explaining "linguistic features" of noncoding DNA.1996
- Explaining "Linguistic Features" of Noncoding DNAScience, 1996
- Noncoding DNA, Zipf's Law, and LanguageScience, 1995
- Linguistic Features of Noncoding DNA SequencesPhysical Review Letters, 1994
- Hints of a Language in Junk DNAScience, 1994
- A Quantitative Test of Long‐range Correlations and Compositional Fluctuations in DNA SequencesEuropean Journal of Biochemistry, 1994
- Pseudorandom number generator for massively parallel molecular-dynamics simulationsPhysical Review E, 1994
- Variations in base pair composition and associated long-range correlations in DNA sequences — computer simulation resultsBiochimica et Biophysica Acta (BBA) - Gene Structure and Expression, 1994
- Biological origins of long-range correlations and compositional variations in DNANucleic Acids Research, 1993
- Patchiness and Correlations in DNA SequencesScience, 1993