Partial results for online query processing
- 3 June 2002
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 275-286
- https://doi.org/10.1145/564691.564723
Abstract
Traditional query processors generate full, accurate query results, either in batch or in pipelined fashion. We argue that this strict model is too rigid for exploratory queries over diverse and distributed data sources, such as sources on the Internet. Instead, we propose a looser model of querying in which a user submits a broad initial query outline, and the system continually generates partial result tuples that may contain values for only some of the output fields. The user can watch these partial results accumulate at the user interface, and accordingly refine the query by specifying their interest in different kinds of partial results.After describing our querying model and user interface, we present a query processing architecture for this model which is implemented in the Telegraph dataflow system. Our architecture is designed to generate partial results quickly, and to adapt query execution to changing user interests. The crux of this architecture is a dataflow operator that supports two kinds of reorderings: reordering of intermediate tuples within a dataflow, and reordering of query plan operators through which tuples flow. We study reordering policies that optimize for the quality of partial results delivered over time, and experimentally demonstrate the benefits of our architecture in this context.Keywords
This publication has 12 references indexed in Scilit:
- Online query processingPublished by Association for Computing Machinery (ACM) ,2001
- Online dynamic reorderingThe VLDB Journal, 2000
- WSQ/DSQPublished by Association for Computing Machinery (ACM) ,2000
- EddiesPublished by Association for Computing Machinery (ACM) ,2000
- An adaptive query execution system for data integrationPublished by Association for Computing Machinery (ACM) ,1999
- Efficient mid-query re-optimization of sub-optimal query execution plansPublished by Association for Computing Machinery (ACM) ,1998
- Cost-based query scrambling for initial delaysPublished by Association for Computing Machinery (ACM) ,1998
- Online aggregationPublished by Association for Computing Machinery (ACM) ,1997
- Query processing and optimization in Oracle RdbThe VLDB Journal, 1996
- Orienteering in an information landscapePublished by Association for Computing Machinery (ACM) ,1993