Skip to main content

Distributed Data Streams

  • Reference work entry
  • First Online:
Book cover Encyclopedia of Database Systems
  • 22 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 4,499.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 6,499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Alon N, Matias Y, Szegedy M. The space complexity of approximating the frequency moments. In: Proceedings of the 28th Annual ACM Symposium on the Theory of Computing; 1996. p. 20–9.

    Google Scholar 

  2. Babcock B, Olston C. Distributed top-K monitoring. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data; 2003.

    Google Scholar 

  3. Balazinska M, Balakrishnan H, Madden S, Stonebraker M. Fault-tolerance in the borealis distributed stream processing system. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2005.

    Google Scholar 

  4. Chu D, Deshpande A, Hellerstein JM, Hong W. Approximate data collection in sensor networks using probabilistic models. In: Proceedings of the 22nd International Conference on Data Engineering; 2006.

    Google Scholar 

  5. Cormode G, Garofalakis M. Sketching streams through the net: distributed approximate query tracking. In: Proceedings of the 31st International Conference on Very Large Data Bases; 2005.

    Google Scholar 

  6. Cormode G, Muthukrishnan S, Yi K. Algorithms for distributed functional monitoring. In: Proceedings of the 19th Annual ACM-SIAM Symposium on Discrete Algorithms; 2008.

    Google Scholar 

  7. Cranor C, Johnson T, Spatscheck O, Shkapenyuk V. Gigascope: a stream database for network applications. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003.

    Google Scholar 

  8. Das A, Ganguly S, Garofalakis M, Rastogi R. Distributed set-expression cardinality estimation. In: Proceedings of the 30th International Conference on Very Large Data Bases; 2004.

    Chapter  Google Scholar 

  9. Flajolet P, Nigel Martin G. Probabilistic counting algorithms for data base applications. J Comput Syst Sci. 1985;31(2):182–209.

    Article  MathSciNet  MATH  Google Scholar 

  10. Garofalakis M, Hellerstein JM, Maniatis P. Proof sketches: verifiable in-network aggregation. In: Proceedings of the 23rd International Conference on Data Engineering; 2007.

    Google Scholar 

  11. Guestrin C, Bodik P, Thibaux R, Paskin M, Madden S. Distributed regression: an efficient framework for modeling sensor network data. Inform. Process. Sensor Networks; 2004.

    Google Scholar 

  12. Huang L, Nguyen X, Garofalakis M, Hellerstein JM, Jordan MI, Joseph AD, Taft N. Communication-efficient online detection of network-wide anomalies. In: Proceedings of the 26th Annual Joint Conference of the IEEE Computer and Communications Societies; 2007.

    Google Scholar 

  13. Jain A, Hellerstein J, Ratnasamy S, Wetherall D. A wakeup call for internet monitoring systems: The case for distributed triggers. In: Proceedings of the Third Workshop on Hot Topics in Networks; 2004.

    Google Scholar 

  14. Jain S, Fall K, Patra R. Routing in a delay tolerant network. In: Proceedings of the ACM International Conference of the on Data Communication; 2005.

    Google Scholar 

  15. Kempe D, Dobra A, Gehrke J. Gossip-based computation of aggregate information. In: Proceedings of the 44th Annual IEEE Symposium on Foundations of Computer Science; 2003.

    Google Scholar 

  16. Keralapura R, Cormode G, Ramamirtham J. Communication-efficient distributed monitoring of thresholded counts. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data; 2006, p. 289–300.

    Google Scholar 

  17. Loo BT, Condie T, Garofalakis M, Gay DE, Hellerstein JM, Maniatis P, Ramakrishnan R, Roscoe T, Stoica I. Declarative networking: language, execution, and optimization. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2006.

    Google Scholar 

  18. Madden S, Franklin MJ, Hellerstein JM, Hong W. TAG: a tiny aggregation service for ad-hoc sensor networks. In: Proceedings of the 5th USENIX Symposium on Operating System Design and Implementation; 2002.

    Google Scholar 

  19. Manjhi A, Nath S, Gibbons P. Tributaries and deltas: efficient and robust aggregation in sensor network streams. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2005.

    Google Scholar 

  20. Nath S, Gibbons PB, Seshan S, Anderson ZR. Synopsis diffusion for robust aggrgation in sensor networks. In: Proceedings of the 2nd International Conference on Embedded Networked Sensor Systems; 2004.

    Google Scholar 

  21. Olston C, Jiang J, Widom J Adaptive filters for continuous queries over distributed data streams. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003.

    Google Scholar 

  22. Pietzuch P, Ledlie J, Schneidman J, Roussopoulos M, Welsh M, Seltzer M. Network-aware operator placement for stream-processing systems. In: Proceedings of the 22nd International Conference on Data Engineering; 2006.

    Google Scholar 

  23. Rhea S, Godfrey B, Karp B, Kubiatowicz J, Ratnasamy S, Shenker S, Stoica I, Yu HY. OpenDHT: a public dht service and its uses. In: Proceedings of the ACM International Conference of the on Data Communication; 2005.

    Google Scholar 

  24. Rissanen J. Modeling by shortest data description. Automatica. 1978;14(5):465–71.

    Article  MATH  Google Scholar 

  25. Shah MA, Hellerstein JM, Brewer E. Highly available, fault-tolerant, parallel dataflows. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2004.

    Google Scholar 

  26. Sharfman I, Schuster A, Keren D. A geometric approach to monitoring threshold functions over distributed data streams. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2006, p. 301–12.

    Google Scholar 

  27. Xing Y, Hwang JH, Cetintemel U, Zdonik S. Providing resiliency to load variations in ditributed stream processing. In: Proceedings of the 32nd International Conference on Very Large Data Bases; 2006.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Minos Garofalakis .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Garofalakis, M. (2018). Distributed Data Streams. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_137

Download citation

Publish with us

Policies and ethics