Selection of views to materialize in a data warehouse
Top Cited Papers
- 17 January 2005
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 17 (1) , 24-43
- https://doi.org/10.1109/tkde.2005.16
Abstract
A data warehouse stores materialized views of data from one or more sources, with the purpose of efficiently implementing decision-support or OLAP queries. One of the most important decisions in designing a data warehouse is the selection of materialized views to be maintained at the warehouse. The goal is to select an appropriate set of views that minimizes total query response time and the cost of maintaining the selected views, given a limited amount of resource, e.g., materialization time, storage space, etc. In This work, we have developed a theoretical framework for the general problem of selection of views in a data warehouse. We present polynomial-time heuristics for a selection of views to optimize total query response time under a disk-space constraint, for some important special cases of the general data warehouse scenario, viz.: 1) an AND view graph, where each query/view has a unique evaluation, e.g., when a multiple-query optimizer can be used to general a global evaluation plan for the queries, and 2) an OR view graph, in which any view can be computed from any one of its related views, e.g., data cubes. We present proofs showing that the algorithms are guaranteed to provide a solution that is fairly close to (within a constant factor ratio of) the optimal solution. We extend our heuristic to the general AND-OR view graphs. Finally, we address in detail the view-selection problem under the maintenance cost constraint and present provably competitive heuristics.Keywords
This publication has 16 references indexed in Scilit:
- View selection using randomized searchData & Knowledge Engineering, 2002
- The view-selection problem has an exponential-time lower bound for conjunctive queries and viewsPublished by Association for Computing Machinery (ACM) ,2002
- SPEEDING UP MATERIALIZED VIEW SELECTION IN DATA WAREHOUSES USING A RANDOMIZED ALGORITHMInternational Journal of Cooperative Information Systems, 2001
- On the complexity of the view-selection problemPublished by Association for Computing Machinery (ACM) ,1999
- Genetic Algorithm for Materialized View Selection in Data Warehouse EnvironmentsPublished by Springer Nature ,1999
- Multi-table joins through bitmapped join indicesACM SIGMOD Record, 1995
- Research problems in data warehousingPublished by Association for Computing Machinery (ACM) ,1995
- Multiple query optimization with Depth-First Branch-and-Bound and dynamic query orderingPublished by Association for Computing Machinery (ACM) ,1993
- Multiple-query optimizationACM Transactions on Database Systems, 1988
- The Logical Access Path Schema of a DatabaseIEEE Transactions on Software Engineering, 1982