Species independence of mutual information in coding and noncoding DNA
- 1 May 2000
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 61 (5) , 5624-5629
- https://doi.org/10.1103/physreve.61.5624
Abstract
We explore if there exist universal statistical patterns that are different in coding and noncoding DNA and can be found in all living organisms, regardless of their phylogenetic origin. We find that (i) the mutual information function I has a significantly different functional form in coding and noncoding DNA. We further find that (ii) the probability distributions of the average mutual information are significantly different in coding and noncoding DNA, while (iii) they are almost the same for organisms of all taxonomic classes. Surprisingly, we find that is capable of predicting coding regions as accurately as organism-specific coding measures.
Keywords
This publication has 26 references indexed in Scilit:
- Identification of protein coding regions in the human genome by quadratic discriminant analysisProceedings of the National Academy of Sciences, 1997
- Automated Gene Identification in Large-Scale Genomic Sequences1Journal of Computational Biology, 1997
- Identification of Protein Coding Regions In Genomic DNAJournal of Molecular Biology, 1995
- Gene Structure Prediction by Linguistic MethodsGenomics, 1994
- A probabilistic model for detecting coding regions in DNA sequencesMathematical Medicine and Biology: A Journal of the IMA, 1994
- Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading framesNucleic Acids Research, 1994
- Prediction of the exon-intron structure by a dynamic programming approachBiosystems, 1993
- Prediction of gene structureJournal of Molecular Biology, 1992
- Recognition of protein coding regions in DNA sequencesNucleic Acids Research, 1982
- Codon preference and its use in identifying protein coding regions in long DNA sequencesNucleic Acids Research, 1982