Recognition of Printed Chinese Characters
- 1 February 1966
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Electronic Computers
- Vol. EC-15 (1) , 91-101
- https://doi.org/10.1109/pgec.1966.264379
Abstract
The problem of recognizing a large alphabet (1000 different characters) is approached using a two stage process. In the first stage of design, the data is partitioned into groups of similar characters by means of heuristic and iterative algorithms. In the second stage, peephole templates are generated for each character in such a way as to guarantee discrimination against other characters in the same similarity class. Recognition is preceded by establishing an order of search through the groups with a relatively small number of ``group masks.'' The character is then identified by means of the ``individual masks.'' through a threshold criterion. The effects on the error and reject rates of varying the several parameters in the design and test procedure are described on the basis of computer simulation experiments on a 20 000 character data set. An error rate of 1 percent with 7 percent rejects, is obtained on new data.Keywords
This publication has 3 references indexed in Scilit:
- An Optical Character ScannerOptical Engineering, 1964
- Machine Translation of ChineseScientific American, 1963
- A “Logical Pattern” Recognition ProgramIBM Journal of Research and Development, 1962