DNA segmentation as a model selection process
- 22 April 2001
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 204-210
- https://doi.org/10.1145/369133.369202
Abstract
Previous divide-and-conquer segmentation analyses of DNA sequences do not provide a satisfactory stopping criterion for the recursion. This paper proposes that segmentation be considered as a model selection process. Using the tools in model selection, a limit for the stopping criterion on the relaxed end can be determined. The Bayesian information criterion, in particular, provides a much more stringent stopping criterion than what is currently used. Such a stringent criterion can be used to delineate larger DNA domains. A relationship between the stopping criterion and the average domain size is empirically determined, which may aid in the determination of isochore borders.Keywords
All Related Versions
This publication has 29 references indexed in Scilit:
- Finding Borders between Coding and Noncoding DNA Regions by an Entropic Segmentation MethodPhysical Review Letters, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Sequence Compositional Complexity of DNA through an Entropic Segmentation MethodPhysical Review Letters, 1998
- Divergence measures based on the Shannon entropyIEEE Transactions on Information Theory, 1991
- THE ISOCHORE ORGANIZATION OF THE HUMAN GENOMEAnnual Review of Genetics, 1989
- Nucleotide sequence of bacteriophage λ DNAJournal of Molecular Biology, 1982
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974
- Theoretical models for heterogeneity of base composition in DNAJournal of Theoretical Biology, 1974
- On Information and SufficiencyThe Annals of Mathematical Statistics, 1951