STREAM: The Stanford Data Stream Management System

Arasu, Arvind; Babcock, Brian; Babu, Shivnath; Cieslewicz, John; Datar, Mayur; Ito, Keith; Motwani, Rajeev; Srivastava, Utkarsh; Widom, Jennifer

doi:10.1007/978-3-540-28608-0_16

Arvind Arasu⁶,
Brian Babcock⁶,
Shivnath Babu⁶,
John Cieslewicz⁶,
Mayur Datar⁶,
Keith Ito⁶,
Rajeev Motwani⁶,
Utkarsh Srivastava⁶ &
…
Jennifer Widom⁶

Part of the book series: Data-Centric Systems and Applications ((DCSA))

3658 Accesses
61 Citations
3 Altmetric

Abstract

Traditional database management systems are best equipped to run one-time queries over finite stored data sets. However, many modern applications such as network monitoring, financial analysis, manufacturing, and sensor networks require long-running, or continuous, queries over continuous unbounded streams of data. In the STREAM project at Stanford, we are investigating data management and query processing for this class of applications. As part of the project we are building a general-purpose prototype Data Stream Management System (DSMS), also called STREAM, that supports a large class of declarative continuous queries over continuous streams and traditional stored data sets. The STREAM prototype targets environments where streams may be rapid, stream characteristics and query loads may vary over time, and system resources may be limited.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Hardcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M.K. Aguilera, R.E. Strom, D.C. Sturman, M. Astley, T.D. Chandra, Matching events in a content-based subscription system, in Proc. of the 18th Annual ACM Symp. on Principles of Distributed Computing (1999), pp. 53–61
Google Scholar
A. Arasu, B. Babcock, S. Babu, J. McAlister, J. Widom, Characterizing memory requirements for queries over continuous data streams. ACM Trans. Database Syst. 29(1), 1–33 (2004)
Article Google Scholar
A. Arasu, S. Babu, J. Widom, The CQL continuous query language: semantic foundations and query execution. VLDB J. 15(2), 121–142 (2006)
Article Google Scholar
B. Babcock, S. Babu, M. Datar, R. Motwani, Chain: operator scheduling for memory minimization in data stream systems, in Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data (2003), pp. 253–264
Chapter Google Scholar
B. Babcock, S. Babu, M. Datar, R. Motwani, J. Widom, Models and issues in data stream systems, in Proc. of the 21st ACM SIGACT–SIGMOD–SIGART Symp. on Principles of Database Systems (2002), pp. 1–16
Google Scholar
B. Babcock, M. Datar, R. Motwani, Load shedding for aggregation queries over data streams, in Proc. of the 20th Intl. Conf. on Data Engineering (2004)
Google Scholar
S. Babu, R. Motwani, K. Munagala, I. Nishizawa, J. Widom, Adaptive ordering of pipelined stream filters, in Proc. of the 2004 ACM SIGMOD Intl. Conf. on Management of Data (2004)
Google Scholar
S. Babu, K. Munagala, J. Widom, R. Motwani, Adaptive caching for continuous queries, in Proc. of the 21st Intl. Conf. on Data Engineering (2005), pp. 118–129
Google Scholar
S. Babu, U. Srivastava, J. Widom, Exploiting \(k\)-constraints to reduce memory overhead in continuous queries over data streams. ACM Trans. Database Syst. 29(3), 545–580 (2004)
Article Google Scholar
S. Babu, J. Widom, StreaMon: an adaptive engine for stream query processing, in Proc. of the 2004 ACM SIGMOD Intl. Conf. on Management of Data (2004). Demonstration description
Google Scholar
B.H. Bloom, Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13(7), 422–426 (1970)
Article MATH Google Scholar
K. Chakrabarti, M.N. Garofalakis, R. Rastogi, K. Shim, Approximate query processing using wavelets, in Proc. of the 26th Intl. Conf. on Very Large Data Bases (2000), pp. 111–122
Google Scholar
J. Gehrke (ed.), Data stream processing. IEEE Comput. Soc. Bull. Technical Comm. Database Eng. 26(1) (2003)
Google Scholar
F. Fabret, H.-.A. Jacobsen, F. Llirbat, J. Pereira, K.A. Ross, D. Shasha, Filtering algorithms and implementation for very fast publish/subscribe, in Proc. of the 2000 ACM SIGMOD Intl. Conf. on Management of Data (2001), pp. 115–126
Google Scholar
R.E. Gruber, B. Krishnamurthy, E. Panagos, READY: a high performance event notification system, in Proc. of the 16th Intl. Conf. on Data Engineering (2000), pp. 668–669
Google Scholar
W. Hoeffding, Probability inequalities for sums of bounded random variables. J. Am. Stat. Soc. 58(301), 13–30 (1963)
Article MathSciNet MATH Google Scholar
U. Srivastava, J. Widom, Flexible time management in data stream systems, in Proc. of the 23rd ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems (2004)
Google Scholar
N. Tatbul, U. Cetintemel, S.B. Zdonik, M. Cherniak, M. Stonebraker, Load shedding in a data stream manager, in Proc. of the 29th Intl. Conf. on Very Large Data Bases (2003), pp. 309–320
Google Scholar
N. Thaper, S. Guha, P. Indyk, N. Koudas, Dynamic multidimensional histograms, in Proc. of the 2002 ACM SIGMOD Intl. Conf. on Management of Data (2002), pp. 428–439
Chapter Google Scholar
D. Thomas, R. Motwani, Caching queues in memory buffers, in Proc. of the 15th Annual ACM–SIAM Symp. on Discrete Algorithms (2004)
Google Scholar
P.A. Tucker, D. Maier, T. Sheard, L. Fegaras, Exploiting punctuation semantics in continuous data streams. IEEE Trans. Knowl. Data Eng. 15(3), 555–568 (2003)
Article Google Scholar
S. Viglas, J.F. Naughton, J. Burger, Maximizing the output rate of multi-way join queries over streaming information sources, in Proc. of the 29th Intl. Conf. on Very Large Data Bases (2003), pp. 285–296
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Stanford University, Stanford, CA, USA
Arvind Arasu, Brian Babcock, Shivnath Babu, John Cieslewicz, Mayur Datar, Keith Ito, Rajeev Motwani, Utkarsh Srivastava & Jennifer Widom

Authors

Arvind Arasu
View author publications
You can also search for this author in PubMed Google Scholar
Brian Babcock
View author publications
You can also search for this author in PubMed Google Scholar
Shivnath Babu
View author publications
You can also search for this author in PubMed Google Scholar
John Cieslewicz
View author publications
You can also search for this author in PubMed Google Scholar
Mayur Datar
View author publications
You can also search for this author in PubMed Google Scholar
Keith Ito
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev Motwani
View author publications
You can also search for this author in PubMed Google Scholar
Utkarsh Srivastava
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Widom
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jennifer Widom .

Editor information

Editors and Affiliations

University Campus - Kounoupidiana, School of ECE, Techn. Univ. of Crete University Campus - Kounoupidiana, Chania, Greece
Minos Garofalakis
Microsoft Corporation, Redmond, Washington, USA
Johannes Gehrke
Amazon India , Bangalore, India
Rajeev Rastogi

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Arasu, A. et al. (2016). STREAM: The Stanford Data Stream Management System. In: Garofalakis, M., Gehrke, J., Rastogi, R. (eds) Data Stream Management. Data-Centric Systems and Applications. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28608-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-28608-0_16
Published: 12 July 2016
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28607-3
Online ISBN: 978-3-540-28608-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics