A declarative language for querying and restructuring the Web
- 23 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
World Wide Web is a hypertext based, distributed information system that provides access to vast amounts of information in the Internet. A fundamental problem with the Web is the difficulty of retrieving specific information of interest to the user, from the enormous number of resources that are available. We develop a simple logic called WebLog that is capable of retrieving information from HTML (Hypertext Markup Language) documents in the Web. WebLog is inspired by SchemaLog, a logic for multidatabase interoperability. We demonstrate the suitability of WebLog for: querying and restructuring Web information; exploiting partial knowledge users might have on the information being queried; and dealing with the dynamic nature of information in the Web. We illustrate the simplicity and power of WebLog using a variety of applications involving real life information in the Web.Keywords
This publication has 9 references indexed in Scilit:
- Information gathering in the World-Wide WebACM Transactions on Database Systems, 1998
- Hypertext Markup Language - 2.0Published by RFC Editor ,1995
- Logical foundations of object-oriented and frame-based languagesJournal of the ACM, 1995
- A database interface for file updatePublished by Association for Computing Machinery (ACM) ,1995
- HyperFile: A data and query model for documentsThe VLDB Journal, 1995
- GENVL and WWWW: Tools for taming the WebComputer Networks and ISDN Systems, 1994
- From structured documents to novel query facilitiesPublished by Association for Computing Machinery (ACM) ,1994
- On the logical foundations of schema integration and evolution in heterogeneous database systemsPublished by Springer Nature ,1993
- EXPRESSACM Transactions on Database Systems, 1977