Stochastic scrabble: large deviations for sequences with scores
- 1 March 1988
- journal article
- Published by Cambridge University Press (CUP) in Journal of Applied Probability
- Vol. 25 (1) , 106-119
- https://doi.org/10.2307/3214238
Abstract
A derivation of a law of large numbers for the highest-scoring matching subsequence is given. Let Xk, Yk be i.i.d. q=(q(i))i∊S letters from a finite alphabet S and v=(v(i))i∊S be a sequence of non-negative real numbers assigned to the letters of S. Using a scoring system similar to that of the game Scrabble, the score of a word w=i1 · ·· im is defined to be V(w)=v(i1) + · ·· + v(im). Let Vn denote the value of the highest-scoring matching contiguous subsequence between X1X2 · ·· Xn and Y1Y2· ·· Yn. In this paper, we show that Vn/K log(n) → 1 a.s. where K ≡ K(q,v). The method employed here involves ‘stuttering’ the letters to construct a Markov chain and applying previous results for the length of the longest matching subsequence. An explicit form for β ∊Pr(S), where β (i) denotes the proportion of letter i found in the highest-scoring word, is given. A similar treatment for Markov chains is also included.Implicit in these results is a large-deviation result for the additive functional, H ≡ Σn < τv(Xn), for a Markov chain stopped at the hitting time τ of some state. We give this large deviation result explicitly, for Markov chains in discrete time and in continuous time.Keywords
This publication has 6 references indexed in Scilit:
- Two Moments Suffice for Poisson Approximations: The Chen-Stein MethodThe Annals of Probability, 1989
- An Extreme Value Theory for Sequence MatchingThe Annals of Statistics, 1986
- Critical Phenomena in Sequence MatchingThe Annals of Probability, 1985
- An Erdös-Rényi law with shiftsAdvances in Mathematics, 1985
- General methods of sequence comparisonBulletin of Mathematical Biology, 1984
- On a new law of large numbersJournal d'Analyse Mathématique, 1970