Incorporating language syntax in visual text recognition with a statistical model

1 January 1996

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 18 (12) , 1251-1255
https://doi.org/10.1109/34.546261

Abstract

The use of a statistical language model to improve the performance of an algorithm for recognizing digital images of handwritten or machine-printed text is discussed. A word recognition algorithm first determines a set of words (called a neighborhood) from a lexicon that are visually similar to each input word image. Syntactic classifications for the words and the transition probabilities between those classifications are input to the Viterbi algorithm. The Viterbi algorithm determines the sequence of syntactic classes (the states of an underlying Markov process) for each sentence that have the maximum a posteriori probability, given the observed neighborhoods. The performance of the word recognition algorithm is improved by removing words from neighborhoods with classes that are not included on the estimated state sequence.An experimental application is demonstrated with a neighborhood generation algorithm that produces a number of guesses about the identity of each word in a running text. The use of zero, first and second order transition probabilities and different levels of noise in estimating the neighborhood are explored.

Keywords

This publication has 10 references indexed in Scilit:

Generalized Viterbi algorithms for error detection with convolutional codes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A hidden Markov model for language syntax in text recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypotheses
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Proper noun detection in document images
Pattern Recognition, 1994
A computational model for recognition of multifont word images
Machine Vision and Applications, 1992
SELF-ORGANIZED LANGUAGE MODELING FOR SPEECH RECOGNITION
Published by Elsevier ,1990
A tree-based statistical language model for natural language speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
Hypothesis Generation in a Computational Model for Visual Word Recognition
IEEE Expert, 1986
A Maximum Likelihood Approach to Continuous Speech Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1983
Experiments in Text Recognition with the Modified Viterbi Algorithm
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1979