A knowledge-based approach to the layout analysis
- 1 January 1995
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 466-471
- https://doi.org/10.1109/icdar.1995.599037
Abstract
In this paper, we present a hybrid approach to the problem of the document analysis in which the document image is segmented by means of a top-down technique and then basic blocks are grouped bottom-up in order to form complex layout components. In this latter process, called layout analysis, only generic knowledge on typesetting conventions is exploited. Such knowledge is independent of the particular class of processed documents and turns out to be valuable for a wide range of documents. Preliminary results of the layout analysis system LEX (Layout EXpert) show the methodological validity of this approach.Keywords
This publication has 12 references indexed in Scilit:
- Automated acquisition of rules for document understandingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A rule-based system for document image segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- MULTISTRATEGY LEARNING FOR DOCUMENT RECOGNITIONApplied Artificial Intelligence, 1994
- Document processing for automatic knowledge acquisitionIEEE Transactions on Knowledge and Data Engineering, 1994
- The document spectrum for page layout analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- Document Image Analysis and RecognitionPublished by World Scientific Pub Co Pte Ltd ,1992
- A prototype document image analysis system for technical journalsComputer, 1992
- Two complementary techniques for digitized document analysisPublished by Association for Computing Machinery (ACM) ,1988
- A robust algorithm for text string separation from mixed text/graphics imagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- Document Analysis SystemIBM Journal of Research and Development, 1982