Context-tree modeling of observed symbolic dynamics
- 26 November 2002
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 66 (5) , 056209
- https://doi.org/10.1103/physreve.66.056209
Abstract
Modern techniques invented for data compression provide efficient automated algorithms for the modeling of the observed symbolic dynamics. We demonstrate the relationship between coding and modeling, motivating the well-known minimum description length (MDL) principle, and give concrete demonstrations of the “context-tree weighting” and “context-tree maximizing” algorithms. The predictive modeling technique obviates many of the technical difficulties traditionally associated with the correct MDL analyses. These symbolic models, representing the symbol generating process as a finite-state automaton with probabilistic emission probabilities, provide excellent and reliable entropy estimations. The resimulations of estimated tree models satisfying the MDL model-selection criterion are faithful to the original in a number of measures. The modeling suggests that the automated context-tree model construction could replace fixed-order word lengths in many traditional forms of empirical symbolic analysis of the data. We provide an explicit pseudocode for implementation of the context-tree weighting and maximizing algorithms, as well as for the conversion to an equivalent Markov chain.Keywords
This publication has 27 references indexed in Scilit:
- Symbolic time-series analysis of neural dataNeurocomputing, 2000
- On the role of pattern matching in information theoryIEEE Transactions on Information Theory, 1998
- Information and entropy in strange attractorsIEEE Transactions on Information Theory, 1989
- Universal coding, information, prediction, and estimationIEEE Transactions on Information Theory, 1984
- Irregularity: a fundamental property of the atmosphereTellus A: Dynamic Meteorology and Oceanography, 1984
- A universal data compression systemIEEE Transactions on Information Theory, 1983
- The performance of universal encodingIEEE Transactions on Information Theory, 1981
- Universal modeling and codingIEEE Transactions on Information Theory, 1981
- A universal algorithm for sequential data compressionIEEE Transactions on Information Theory, 1977
- Universal codeword sets and representations of the integersIEEE Transactions on Information Theory, 1975