Incorporating language syntax in visual text recognition with a statistical model
- 1 January 1996
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 18 (12) , 1251-1255
- https://doi.org/10.1109/34.546261
Abstract
The use of a statistical language model to improve the performance of an algorithm for recognizing digital images of handwritten or machine-printed text is discussed. A word recognition algorithm first determines a set of words (called a neighborhood) from a lexicon that are visually similar to each input word image. Syntactic classifications for the words and the transition probabilities between those classifications are input to the Viterbi algorithm. The Viterbi algorithm determines the sequence of syntactic classes (the states of an underlying Markov process) for each sentence that have the maximum a posteriori probability, given the observed neighborhoods. The performance of the word recognition algorithm is improved by removing words from neighborhoods with classes that are not included on the estimated state sequence.An experimental application is demonstrated with a neighborhood generation algorithm that produces a number of guesses about the identity of each word in a running text. The use of zero, first and second order transition probabilities and different levels of noise in estimating the neighborhood are explored.Keywords
This publication has 10 references indexed in Scilit:
- Generalized Viterbi algorithms for error detection with convolutional codesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A hidden Markov model for language syntax in text recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypothesesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Proper noun detection in document imagesPattern Recognition, 1994
- A computational model for recognition of multifont word imagesMachine Vision and Applications, 1992
- SELF-ORGANIZED LANGUAGE MODELING FOR SPEECH RECOGNITIONPublished by Elsevier ,1990
- A tree-based statistical language model for natural language speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- Hypothesis Generation in a Computational Model for Visual Word RecognitionIEEE Expert, 1986
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- Experiments in Text Recognition with the Modified Viterbi AlgorithmIEEE Transactions on Pattern Analysis and Machine Intelligence, 1979