Document image decoding using Markov source models

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 5, 85-88 vol.5
https://doi.org/10.1109/icassp.1993.319753

Abstract

The authors describe a communication theory approach to document image reconstruction, patterned after the use of hidden Markov models in speech recognition. A document recognition problem is viewed as consisting of three elements-an image generator, a noisy channel, and an image decoder. A document image generator is a Markov source which combines a message source with an imager. The message source produces a string of symbols which contains the information to be transmitted. The imager is modeled as a finite-state transducer, which converts the message into an ideal bitmap. The channel transforms the ideal image into a noisy observed image. The decoder estimates the message from the observed image by finding the a posteriori most probable path through the combined source and channel models using a Viterbi-like algorithm. Application of the proposed method to decoding telephone yellow pages is described.<>

Keywords

This publication has 6 references indexed in Scilit:

Hidden Markov models for character recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Connected and degraded text recognition using hidden Markov model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Word spotting in scanned images using hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
Parsing 2-Dimensional Language
Published by Springer Nature ,1991
Recognition of handwritten word: First and second order hidden Markov model based approach
Pattern Recognition, 1989
A Maximum Likelihood Approach to Continuous Speech Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1983