Automated acquisition of rules for document understanding
- 30 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
A study on the possibility of adopting a supervised inductive learning approach to the problem of document understanding is presented. A representation language used to describe a page layout is introduced and the opportunity of extending such a language by means of intentionally defined predicates is discussed. Experimental results obtained by using a well-known learning system, FOCL, are presented. They confirm the exigency of redefining the problem of document understanding in terms of a new strategy of supervised inductive learning, called contextual learning. Some experiments in which a dependence hierarchy between concepts is defined show that contextual rules increase predictive accuracy and decrease learning time for labeling problems like document understanding.<>Keywords
This publication has 5 references indexed in Scilit:
- Understanding multi-articled documentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The utility of knowledge in inductive learningMachine Learning, 1992
- Learning logical definitions from relationsMachine Learning, 1990
- An experimental page layout recognition system for office document automatic classification: an integrated approach for inductive generalizationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990
- The Consistent Labeling Problem: Part IIEEE Transactions on Pattern Analysis and Machine Intelligence, 1979