Query caching and optimization in distributed mediator systems
- 1 June 1996
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 25 (2) , 137-146
- https://doi.org/10.1145/235968.233327
Abstract
Query processing and optimization in mediator systems that access distributed non-proprietary sources pose many novel problems. Cost-based query optimization is hard because the mediator does not have access to source statistics information and furthermore it may not be easy to model the source's performance. At the same time, querying remote sources may be very expensive because of high connection overhead, long computation time, financial charges, and temporary unavailability. We propose a cost-based optimization technique that caches statistics of actual calls to the sources and consequently estimates the cost of the possible execution plans based on the statistics cache. We investigate issues pertaining to the design of the statistics cache and experimentally analyze various tradeoffs. We also present a query result caching mechanism that allows us to effectively use results of prior queries when the source is not readily available. We employ the novel invariants mechanism, which shows how semantic information about data sources may be used to discover cached query results of interest.Keywords
This publication has 19 references indexed in Scilit:
- Hybrid knowledge basesIEEE Transactions on Knowledge and Data Engineering, 1996
- The Advanced Video Information System: data structures and query processingMultimedia Systems, 1996
- Amalgamating knowledge basesACM Transactions on Database Systems, 1994
- Function materialization in object bases: design, realization, and evaluationIEEE Transactions on Knowledge and Data Engineering, 1994
- On the logical foundations of schema integration and evolution in heterogeneous database systemsPublished by Springer Nature ,1993
- Memoing for logic programsCommunications of the ACM, 1992
- Updatable views in object-oriented databasesPublished by Springer Nature ,1991
- Federated database systems for managing distributed, heterogeneous, and autonomous databasesACM Computing Surveys, 1990
- Updating derived relations: detecting irrelevant and autonomously computable updatesACM Transactions on Database Systems, 1989
- Maintenance of viewsACM SIGMOD Record, 1984