Rewriting of regular expressions and regular path queries
- 1 May 1999
- proceedings article
- Published by Association for Computing Machinery (ACM)
- Vol. 64 (3) , 194-204
- https://doi.org/10.1145/303976.303996
Abstract
Recent work on semi-structured data has revitalized the interest in path queries, i.e., queries that ask for all pairs of objects in the database that are connected by a path conforming to a certain specification, in partic- ular to a regular expression. Also, in semi-structured data, as well as in data integration, data warehousing, and query optimization, the problem of query rewriting using views is receiving much attention: Given a query and a collection of views, generate a new query which uses the views and provides the answer to the original one. In this paper we address the problem of query rewrit- ing using views in the context of semi-structured data. We present a method for computing the rewriting of a regular expression E in terms of other regular expres- sions. The method computes the exact rewriting (the one that defines the same regular language as E) if it exists, or the rewriting that defines the maximal lan- guage contained in the one defined by E, otherwise. We present a complexity analysis of both the problem and the method, showing that the latter is essentially opti- mal. Finally, we illustrate how to exploit the method to rewrite regular path queries using views in semi- structured data. The complexity results established for the rewriting of regular expressions apply also to the case of regular path queries.Keywords
This publication has 23 references indexed in Scilit:
- Rewriting aggregate queries using viewsPublished by Association for Computing Machinery (ACM) ,1999
- Complexity of answering queries using materialized viewsPublished by Association for Computing Machinery (ACM) ,1998
- Path constraints on semistructured and structured dataPublished by Association for Computing Machinery (ACM) ,1998
- A query language for a Web-site management systemACM SIGMOD Record, 1997
- The Lorel query language for semistructured dataInternational Journal on Digital Libraries, 1997
- The GMAP: a versatile tool for physical data independenceThe VLDB Journal, 1996
- Query caching and optimization in distributed mediator systemsPublished by Association for Computing Machinery (ACM) ,1996
- GraphLogPublished by Association for Computing Machinery (ACM) ,1990
- Space-bounded reducibility among combinatorial problemsJournal of Computer and System Sciences, 1975
- Relationships between nondeterministic and deterministic tape complexitiesJournal of Computer and System Sciences, 1970