Physical database design for data warehouses
- 22 November 2002
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 277-288
- https://doi.org/10.1109/icde.1997.581802
Abstract
Data warehouses collect copies of information from remote sources into a single database. Since the remote data is cached at the warehouse, it appears as local relations to the users of the warehouse. To improve query response time, the warehouse administrator will often materialize views defined on the local relations to support common or complicated queries. Unfortunately, the requirement to keep the views consistent with the local relations creates additional overhead when the remote sources change. The warehouse is often kept only loosely consistent with the sources: it is periodically refreshed with changes sent from the source. When this happens, the warehouse is taken off-line until the local relations and materialized views can be updated. Clearly, the users would prefer as little down time as possible. Often the down time can be reduced by adding carefully selected materialized views or indexes to the physical schema. This paper studies how to select the sets of supporting views and of indexes to materialize to minimize the down time. We call this the view index selection (VIS) problem. We present an A* search based solution to the problem as well as rules of thumb. We also perform additional experiments to understand the space-time tradeoff as it applies to data warehouses.Keywords
This publication has 13 references indexed in Scilit:
- Materialized view maintenance and integrity constraint checkingPublished by Association for Computing Machinery (ACM) ,1996
- Answering queries using views (extended abstract)Published by Association for Computing Machinery (ACM) ,1995
- Incremental maintenance of views with duplicatesPublished by Association for Computing Machinery (ACM) ,1995
- Maintaining views incrementallyPublished by Association for Computing Machinery (ACM) ,1993
- Optimal Update Policies for Distributed Materialized ViewsManagement Science, 1991
- Updating distributed materialized viewsIEEE Transactions on Knowledge and Data Engineering, 1989
- Physical database design for relational databasesACM Transactions on Database Systems, 1988
- Multiple-query optimizationACM Transactions on Database Systems, 1988
- Rete: A fast algorithm for the many pattern/many object pattern match problemArtificial Intelligence, 1982
- View indexing in relational databasesACM Transactions on Database Systems, 1982