Document image decoding using Markov source models

1 June 1994

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 16 (6) , 602-617
https://doi.org/10.1109/34.295905

Abstract

Document image decoding (DID) is a communication theory approach to document image recognition. In DID, a document recognition problem is viewed as consisting of three elements: an image generator, a noisy channel and an image decoder. A document image generator is a Markov source (stochastic finite-state automaton) that combines a message source with an imager. The message source produces a string of symbols, or text, that contains the information to be transmitted. The imager is modeled as a finite-state transducer that converts the 1D message string into an ideal 2D bitmap. The channel transforms the ideal image into a noisy observed image. The decoder estimates the message, given the observed image, by finding the a posteriori most probable path through the combined source and channel models using a Viterbi-like dynamic programming algorithm. The proposed approach is illustrated on the problem of decoding scanned telephone yellow pages to extract names and numbers from the listings. A finite-state model for yellow page columns was constructed and used to decode a database of scanned column images containing about 1100 individual listings.

Keywords

This publication has 10 references indexed in Scilit:

Hidden Markov models for character recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Off-line handwritten word recognition using a hidden Markov model type stochastic network
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1994
Least-squares font metric estimation from images
IEEE Transactions on Image Processing, 1993
Word spotting in scanned images using hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
Connected and degraded text recognition using planar hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
Lindenmayer Systems, Fractals, and Plants
Published by Springer Nature ,1989
A Maximum Likelihood Approach to Continuous Speech Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1983
Document Analysis System
IBM Journal of Research and Development, 1982
Continuous speech recognition by statistical methods
Proceedings of the IEEE, 1976
The Organization of Computations for Uniform Recurrence Equations
Journal of the ACM, 1967