Online aggregation
- 1 June 1997
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 26 (2) , 171-182
- https://doi.org/10.1145/253262.253291
Abstract
Aggregation in traditional database systems is performed in batch mode: a query is submitted, the system processes a large volume of data over a long period of time, and, eventually, the final answer is returned. This archaic approach is frustrating to users and has been abandoned in most other areas of computing. In this paper we propose a new online aggregation interface that permits users to both observe the progress of their aggregation queries and control execution on the fly. After outlining usability and performance requirements for a system supporting online aggregation, we present a suite of techniques that extend a database system to meet these requirements. These include methods for returning the output in random order, for providing control over the relative rate at which different aggregates are computed, and for computing running confidence intervals. Finally, we report on an initial implementation of online aggregation in POSTGRES.Keywords
This publication has 17 references indexed in Scilit:
- Query processing and optimization in Oracle RdbThe VLDB Journal, 1996
- Selectivity and Cost Estimation for Joins Based on Random SamplingJournal of Computer and System Sciences, 1996
- Processing queries for first-few answersPublished by Association for Computing Machinery (ACM) ,1996
- Efficient sampling strategies for relational database operationsTheoretical Computer Science, 1993
- APPROXIMATE-a query processor that produces monotonically improving approximate answersIEEE Transactions on Knowledge and Data Engineering, 1993
- Multiresolution coding techniques for digital television: A reviewMultidimensional Systems and Signal Processing, 1992
- Implementation techniques for main memory database systemsPublished by Association for Computing Machinery (ACM) ,1984
- Algorithms for Computing the Sample Variance: Analysis and RecommendationsThe American Statistician, 1983
- Access path selection in a relational database management systemPublished by Association for Computing Machinery (ACM) ,1979
- Probability Inequalities for Sums of Bounded Random VariablesJournal of the American Statistical Association, 1963