Optimising Web queries using document type definitions
- 1 November 1999
- conference paper
- Published by Association for Computing Machinery (ACM)
Abstract
A document type definition (DTD) D defines the structure of elements permitted in any web document valid with respect to D. From a given DTD D we show how to derive a number of simple structural constraints which are implied by D. Using a relational abstraction of web databases, we consider a class of conjunctive queries which retrieve elements from web documents stored in a database D. For simplicity, we assume that all documents in D are valid with respect to the same DTDD. The main contribution of the paper is the use of the constraints derived from D to optimise conjunctive queries on D by removing redundant conjuncts. The relational abstraction allows us to show that the constraints derived from a DTD are equivalent to tuple-generating and equality-generating dependencies which hold on D. Having done so, we can use the chase algorithm to show equivalence between a query and its reduced form.Keywords
This publication has 4 references indexed in Scilit:
- Query rewriting for semistructured dataPublished by Association for Computing Machinery (ACM) ,1999
- Path constraints on semistructured and structured dataPublished by Association for Computing Machinery (ACM) ,1998
- Regular path queries with constraintsPublished by Association for Computing Machinery (ACM) ,1997
- Querying documents in object databasesInternational Journal on Digital Libraries, 1997