Approximate aggregation techniques for sensor databases
Top Cited Papers
- 28 September 2004
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
In the emerging area of sensor-based systems, a sig- nificant challenge is to develop scalable, fault-tolerant methods to extract useful information from the data the sensors collect. An approach to this data management problem is the use of sensor database systems, exempli- fied by TinyDB and Cougar, which allow users to per- form aggregation queries such as MIN, COUNT and AVG on a sensor network. Due to power and range con- straints, centralized approaches are generally impracti- cal, so most systems use in-network aggregation to re- duce network traffic. However, these aggregation strate- gies become bandwidth-intensive when combined with the fault-tolerant, multi-path routing methods often used in these environments. For example, duplicate-sensitive ag- gregates such as SUM cannot be computed exactly us- ing substantially less bandwidth than explicit enumera- tion. To avoid this expense, we investigate the use of ap- proximate in-network aggregation using small sketches. Our contributions are as follows: 1) we generalize well known duplicate-insensitive sketches for approximating COUNT to handle SUM, 2) we present and analyze meth- ods for using sketches to produce accurate results with lowcommunication and computation overhead, and 3) we present an extensive experimental validation of our methods.Keywords
This publication has 18 references indexed in Scilit:
- GRAdient Broadcast: A Robust Data Delivery Protocol for Large Scale Sensor NetworksWireless Networks, 2005
- Computing aggregates for monitoring wireless sensor networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The design of an acquisitional query processor for sensor networksPublished by Association for Computing Machinery (ACM) ,2003
- The cougar approach to in-network query processing in sensor networksACM SIGMOD Record, 2002
- ANFPublished by Association for Computing Machinery (ACM) ,2002
- TAGPublished by Association for Computing Machinery (ACM) ,2002
- Directed diffusionPublished by Association for Computing Machinery (ACM) ,2000
- Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-TotalsData Mining and Knowledge Discovery, 1997
- Binomial random variate generationCommunications of the ACM, 1988
- An Efficient Method for Generating Discrete Random Variables with General DistributionsACM Transactions on Mathematical Software, 1977