Automatic Annotation of Content-Rich HTML Documents: Structural and Semantic Analysis
- 1 January 2003
- book chapter
- Published by Springer Nature
Abstract
No abstract availableKeywords
This publication has 23 references indexed in Scilit:
- Extracting structured data from Web pagesPublished by Association for Computing Machinery (ACM) ,2003
- On deep annotationPublished by Association for Computing Machinery (ACM) ,2003
- SemTag and seekerPublished by Association for Computing Machinery (ACM) ,2003
- A fully automated object extraction system for the World Wide WebPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Authoring and annotation of web pages in CREAMPublished by Association for Computing Machinery (ACM) ,2002
- A flexible learning system for wrapping tables and lists in HTML documentsPublished by Association for Computing Machinery (ACM) ,2002
- XTRACTPublished by Association for Computing Machinery (ACM) ,2000
- Record-boundary discovery in Web documentsPublished by Association for Computing Machinery (ACM) ,1999
- Ontology-based extraction and structuring of information from data-rich unstructured documentsPublished by Association for Computing Machinery (ACM) ,1998
- Template-based wrappers in the TSIMMIS systemPublished by Association for Computing Machinery (ACM) ,1997