Upper bounds on the probability of sequences emitted by finite-state sources and on the redundancy of the Lempel-Ziv algorithm
- 1 January 1992
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Information Theory
- Vol. 38 (1) , 66-72
- https://doi.org/10.1109/18.108250
Abstract
An upper bound on the probability of a sequence drawn from a finite-state source is derived. The bound is given in terms of the number of phrases obtained by parsing the sequence according to the Lempel-Ziv (L-Z) incremental parsing rule, and is universal in the sense that it does not depend on the statistical parameters that characterize the source. This bound is used to derive an upper bound on the redundance of the L-Z universal data compression algorithm applied to finite-state sources, that depends on the length N of the sequence, on the number K of states of the source, and, eventually, on the source entropy. A variation of the L-Z algorithm is presented, and an upper bound on its redundancy is derived for finite-state sources. A method to derive tighter implicit upper bounds on the redundancy of both algorithms is also given, and it is shown that for the proposed variation this bound is smaller than for the original L-Z algorithm, or every value of N and KKeywords
This publication has 5 references indexed in Scilit:
- Comparison between universal data-compression algorithms applied to Markov sourcesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Compression, Tests for Randomness and Estimating the Statistical Model of an Individual SequencePublished by Springer Nature ,1990
- Complexity of strings in the class of Markov sourcesIEEE Transactions on Information Theory, 1986
- Compression of individual sequences via variable-rate codingIEEE Transactions on Information Theory, 1978
- On the Complexity of Finite SequencesIEEE Transactions on Information Theory, 1976