Automatic Annotation of Content-Rich HTML Documents: Structural and Semantic Analysis

No abstract available

This publication has 23 references indexed in Scilit:

Extracting structured data from Web pages
Published by Association for Computing Machinery (ACM) ,2003
On deep annotation
Published by Association for Computing Machinery (ACM) ,2003
SemTag and seeker
Published by Association for Computing Machinery (ACM) ,2003
A fully automated object extraction system for the World Wide Web
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Authoring and annotation of web pages in CREAM
Published by Association for Computing Machinery (ACM) ,2002
A flexible learning system for wrapping tables and lists in HTML documents
Published by Association for Computing Machinery (ACM) ,2002
XTRACT
Published by Association for Computing Machinery (ACM) ,2000
Record-boundary discovery in Web documents
Published by Association for Computing Machinery (ACM) ,1999
Ontology-based extraction and structuring of information from data-rich unstructured documents
Published by Association for Computing Machinery (ACM) ,1998
Template-based wrappers in the TSIMMIS system
Published by Association for Computing Machinery (ACM) ,1997