WebOQL: restructuring documents, databases and Webs
Top Cited Papers
- 27 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The widespread use of the Web has originated several new data management problems, such as extracting data from Web pages and making databases accessible from Web browsers, and has renewed the interest in problems that had appeared before in other contexts, such as querying graphs, semistructured data and structured documents. Several systems and languages have been proposed for solving each of these Web data management problems, but none of these systems addresses all the problems from a unified perspective. Many of these problems essentially amount to data restructuring: we have information represented according to a certain structure and we want to construct another representation of (part of it) using a different structure. We present the WebOQL system, which supports a general class of data restructuring operations in the context of the Web. WebOQL synthesizes ideas from query languages for the Web, for semistructured data and for Website restructuring.Keywords
This publication has 14 references indexed in Scilit:
- Object exchange across heterogeneous information sourcesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Information gathering in the World-Wide WebACM Transactions on Database Systems, 1998
- Semistructured and structured data in the Web: going back and forthACM SIGMOD Record, 1997
- Applications of a Web query languageComputer Networks and ISDN Systems, 1997
- The Lorel query language for semistructured dataInternational Journal on Digital Libraries, 1997
- Querying documents in object databasesInternational Journal on Digital Libraries, 1997
- A query language and optimization techniques for unstructured dataACM SIGMOD Record, 1996
- Finding Regular Simple Paths in Graph DatabasesSIAM Journal on Computing, 1995
- Principles of programming with complex objects and collection typesTheoretical Computer Science, 1995
- An algebra for structured office documentsACM Transactions on Information Systems, 1989