Exploiting punctuation semantics in continuous data streams
Top Cited Papers
- 13 May 2003
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 15 (3) , 555-568
- https://doi.org/10.1109/tkde.2003.1198390
Abstract
As most current query processing architectures are already pipelined, it seems logical to apply them to data streams. However, two classes of query operators are impractical for processing long or infinite data streams. Unbounded stateful operators maintain state with no upper bound in size and, so, run out of memory. Blocking operators read an entire input before emitting a single output and, so, might never produce a result. We believe that a priori knowledge of a data stream can permit the use of such operators in some cases. We discuss a kind of stream semantics called punctuated streams. Punctuations in a stream mark the end of substreams allowing us to view an infinite stream as a mixture of finite streams. We introduce three kinds of invariants to specify the proper behavior of operators in the presence of punctuation. Pass invariants define when results can be passed on. Keep invariants define what must be kept in local state to continue successful operation. Propagation invariants define when punctuation can be passed on. We report on our initial implementation and show a strategy for proving implementations of these invariants are faithful to their relational counterparts.Keywords
This publication has 17 references indexed in Scilit:
- Fjording the stream: an architecture for queries over streaming sensor dataPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Query processing of streamed XML dataPublished by Association for Computing Machinery (ACM) ,2002
- Continuously adaptive continuous queries over streamsPublished by Association for Computing Machinery (ACM) ,2002
- Continuous queries over data streamsACM SIGMOD Record, 2001
- On computing correlated aggregates over continual data streamsACM SIGMOD Record, 2001
- HancockPublished by Association for Computing Machinery (ACM) ,2000
- Online aggregationACM SIGMOD Record, 1997
- Sequence query processingPublished by Association for Computing Machinery (ACM) ,1994
- Query evaluation techniques for large databasesACM Computing Surveys, 1993
- ASPEN: A stream processing environmentPublished by Springer Nature ,1989