The statistical distribution of nucleic acid similarities
Open Access
- 1 January 1985
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 13 (2) , 645-656
- https://doi.org/10.1093/nar/13.2.645
Abstract
All pairs of a large set of known vertebrate DNA sequences were searched by computer for most similar segments. Analysis of this data shows that the computed similarity scores are distributed proportionally to the logarithm of the product of the lengths of the sequences involved. This distribution is closely related to recent results of Erdos and others on the longest run of heads in coin tossing. A simple rule is derived for determination of statistical significance of the similarity scores and to assist in relating statistical and biological significance.Keywords
This publication has 29 references indexed in Scilit:
- Sequence banks: Searching for sequence similaritiesNature, 1983
- Low molecular weight RNAs transcribed in vitro by RNA polymerase III from Alu-type dispersed repeats in Chinese hamster DNA are also found in vivo.Proceedings of the National Academy of Sciences, 1981
- Reiterated sequences within the intron of an immediate-early gene of herpes simplex virus type 1Nucleic Acids Research, 1981
- Molecular cloning and characterization of cDNA sequences coding for rat relaxinNature, 1981
- Structural analysis of interspersed repetitive polymerase III transcription units in human DNA.1981
- The structure of a human α-globin pseudogene and its relationship to α-globin gene duplicationCell, 1980
- The ovalbumin gene family: Structure of the X gene and evolution of duplicated split genesCell, 1980
- Isolation and sequence of the gene for actin in Saccharomyces cerevisiae.Proceedings of the National Academy of Sciences, 1980
- Rearrangement of immunoglobulin gamma 1-chain gene and mechanism for heavy-chain class switch.Proceedings of the National Academy of Sciences, 1980
- Nucleotide sequence and amplification in bacteria of structural gene for rat growth hormoneNature, 1977