Document image decoding using Markov source models
- 1 June 1994
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 16 (6) , 602-617
- https://doi.org/10.1109/34.295905
Abstract
Document image decoding (DID) is a communication theory approach to document image recognition. In DID, a document recognition problem is viewed as consisting of three elements: an image generator, a noisy channel and an image decoder. A document image generator is a Markov source (stochastic finite-state automaton) that combines a message source with an imager. The message source produces a string of symbols, or text, that contains the information to be transmitted. The imager is modeled as a finite-state transducer that converts the 1D message string into an ideal 2D bitmap. The channel transforms the ideal image into a noisy observed image. The decoder estimates the message, given the observed image, by finding the a posteriori most probable path through the combined source and channel models using a Viterbi-like dynamic programming algorithm. The proposed approach is illustrated on the problem of decoding scanned telephone yellow pages to extract names and numbers from the listings. A finite-state model for yellow page columns was constructed and used to decode a database of scanned column images containing about 1100 individual listings.Keywords
This publication has 10 references indexed in Scilit:
- Hidden Markov models for character recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Off-line handwritten word recognition using a hidden Markov model type stochastic networkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1994
- Least-squares font metric estimation from imagesIEEE Transactions on Image Processing, 1993
- Word spotting in scanned images using hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- Connected and degraded text recognition using planar hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- Lindenmayer Systems, Fractals, and PlantsPublished by Springer Nature ,1989
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- Document Analysis SystemIBM Journal of Research and Development, 1982
- Continuous speech recognition by statistical methodsProceedings of the IEEE, 1976
- The Organization of Computations for Uniform Recurrence EquationsJournal of the ACM, 1967