Recognition of Printed Chinese Characters

1 February 1966

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Electronic Computers

Vol. EC-15 (1) , 91-101
https://doi.org/10.1109/pgec.1966.264379

Abstract

The problem of recognizing a large alphabet (1000 different characters) is approached using a two stage process. In the first stage of design, the data is partitioned into groups of similar characters by means of heuristic and iterative algorithms. In the second stage, peephole templates are generated for each character in such a way as to guarantee discrimination against other characters in the same similarity class. Recognition is preceded by establishing an order of search through the groups with a relatively small number of ``group masks.'' The character is then identified by means of the ``individual masks.'' through a threshold criterion. The effects on the error and reject rates of varying the several parameters in the design and test procedure are described on the basis of computer simulation experiments on a 20 000 character data set. An error rate of 1 percent with 7 percent rejects, is obtained on new data.

Keywords

This publication has 3 references indexed in Scilit:

An Optical Character Scanner
Optical Engineering, 1964
Machine Translation of Chinese
Scientific American, 1963
A “Logical Pattern” Recognition Program
IBM Journal of Research and Development, 1962