The statistical analysis of direct repeats in nucleic acid sequences
- 1 March 1985
- journal article
- Published by Cambridge University Press (CUP) in Journal of Applied Probability
- Vol. 22 (1) , 15-24
- https://doi.org/10.2307/3213744
Abstract
Sequence symmetries in DNA and RNA are being discovered at an increasing rate. Conjectures and hypotheses are being proposed for their possible structural and functional role in the nucleic acid. In this paper a probability model is studied which evaluates the probabilities of various repeats occurring by chance alone. Expressions are derived for the mean and variance of the statistics employed. The central limit theorem for dependent trials is used to obtain the asymptotic distributions. An indication is given of how to use the model to search for various gene amplification events in the evolutionary history of the sequences.Keywords
This publication has 5 references indexed in Scilit:
- The number of repeats expected in random nucleic acid sequences and found in genesJournal of Theoretical Biology, 1981
- On the symmetries of multi-palindromic DNA sequencesJournal of Theoretical Biology, 1978
- On the statistical significance of primary structural features found in DNA-protein interaction sitesNucleic Acids Research, 1975
- Statistical significance of DNA sequence symmetriesNature, 1975
- The central limit theorem for dependent random variablesDuke Mathematical Journal, 1948