Character string extraction from a color document
- 1 January 1999
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
An algorithm for the extraction of character strings from a color document is proposed. We first divide the full color image of a document into several representative color images. Then, character strings are nominated from each binary image by using multi-stage relaxation. However, the nominated strings are not always characters. Therefore, when all nominated strings of all images are superimposed, some strings overlap each other. So, we selected the appropriate strings from them using a likelihood of a character string and two binds of conflict resolution. Finally, we show the results of the experiments and discuss some problems of character string extraction from a color document.Keywords
This publication has 6 references indexed in Scilit:
- Recognition of character strings from color urban map images on the basis of validation mechanismPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Character string extraction by multi-stage relaxationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- AUTOMATIC TEXT LOCATION IN IMAGES AND VIDEO FRAMESPattern Recognition, 1998
- The document spectrum for page layout analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- Historical review of OCR research and developmentProceedings of the IEEE, 1992
- A robust algorithm for text string separation from mixed text/graphics imagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988