The Rate of Convergence of the Mean Length of the Longest Common Subsequence
Open Access
- 1 November 1994
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Applied Probability
- Vol. 4 (4) , 1074-1082
- https://doi.org/10.1214/aoap/1177004903
Abstract
Given two i.i.d. sequences of $n$ letters from a finite alphabet, one can consider the length $L_n$ of the longest sequence which is a subsequence of both the given sequences. It is known that $EL_n$ grows like $\gamma n$ for some $\gamma \in \lbrack 0, 1\rbrack$. Here it is shown that $\gamma n \geq EL_n \geq \gamma n - C(n \log n)^{1/2}$ for an explicit numerical constant $C$ which does not depend on the distribution of the letters. In simulations with $n = 100,000, EL_n/n$ can be determined from $k$ such trials with 95% confidence to within $0.0055/\sqrt k$, and the results here show that $\gamma$ can then be determined with 95% confidence to within $0.0225 + 0.0055/\sqrt k$, for an arbitrary letter distribution.
Keywords
This publication has 0 references indexed in Scilit: