Statistical syntactic methods for high-performance OCR

1 January 1996

journal article
Published by Institution of Engineering and Technology (IET) in IEE Proceedings - Vision, Image, and Signal Processing

Vol. 143 (1) , 23-30
https://doi.org/10.1049/ip-vis:19960253

Abstract

The paper describes a new method for language modelling and reports its application to handwritten OCR. Images of characters are first chain-coded to convert them to strings. A novel language modelling method is then applied to build a statistical model for strings of each class. The language modelling method is based on a probabilistic version of an n-tuple classifier which is scanned along the entire string for both training and recognition. This method is extremely fast and robust, and concentrates all the computational effort on the portion of the image where the information is, i.e. the edges left by the trace of the pen. Results on the CEDAR handwritten digit database show the new method to be almost as accurate as the best methods reported so far, while offering a significant speed advantage.

Keywords

This publication has 9 references indexed in Scilit:

A comparison of syntactic and statistical techniques for off-line OCR
Published by Springer Nature ,1994
Inference of k-testable languages in the strict sense and application to syntactic pattern recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1990
The estimation of stochastic context-free grammars using the Inside-Outside algorithm
Computer Speech & Language, 1990
Modelling (Sub)String Length Based Constraints through a Grammatical Inference Method
Published by Springer Nature ,1987
An introduction to hidden Markov models
IEEE ASSP Magazine, 1986
The development of an experimental discrete dictation recognizer
Proceedings of the IEEE, 1985
WISARD·a radical step forward in image recognition
Sensor Review, 1984
Trainable grammars for speech recognition
The Journal of the Acoustical Society of America, 1979
Guide to pattern recognition using random-access memories
IEE Journal on Computers and Digital Techniques, 1979