Adaptive ordering of pipelined stream filters
Top Cited Papers
- 13 June 2004
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 407-418
- https://doi.org/10.1145/1007568.1007615
Abstract
We consider the problem of pipelined filters, where a continuous stream of tuples is processed by a set of commutative filters. Pipelined filters are common in stream applications and capture a large class of multiway stream joins. We focus on the problem of ordering the filters adaptively to minimize processing cost in an environment where stream and filter characteristics vary unpredictably over time. Our core algorithm, A-Greedy (for Adaptive Greedy), has strong theoretical guarantees: If stream and filter characteristics were to stabilize, A-Greedy would converge to an ordering within a small constant factor of optimal. (In experiments A-Greedy usually converges to the optimal ordering.) One very important feature of A-Greedy is that it monitors and responds to selectivities that are correlated across filters (i.e., that are nonindependent), which provides the strong quality guarantee but incurs run-time overhead. We identify a three-way tradeoff among provable convergence to good orderings, run-time overhead, and speed of adaptivity. We develop a suite of variants of A-Greedy that lie at different points on this tradeoff spectrum. We have implemented all our algorithms in the STREAM prototype Data Stream Management System and a thorough performance evaluation is presented.Keywords
This publication has 17 references indexed in Scilit:
- An initial study of overheads of eddiesACM SIGMOD Record, 2004
- Stream window join: tracking moving objects in sensor-network databasesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- GigascopePublished by Association for Computing Machinery (ACM) ,2003
- Continuously adaptive continuous queries over streamsPublished by Association for Computing Machinery (ACM) ,2002
- Conjunctive selection conditions in main memoryPublished by Association for Computing Machinery (ACM) ,2002
- An adaptive query execution system for data integrationPublished by Association for Computing Machinery (ACM) ,1999
- Optimization of queries with user-defined predicatesACM Transactions on Database Systems, 1999
- Optimization techniques for queries with expensive methodsACM Transactions on Database Systems, 1998
- Cost-based query scrambling for initial delaysPublished by Association for Computing Machinery (ACM) ,1998
- Implications of certain assumptions in database performance evauationACM Transactions on Database Systems, 1984