Skip to main content

Distributed Sequence Pattern Detection Over Multiple Data Streams

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9282))

Abstract

Sequence pattern detection over streaming data has many real world applications. Most of the present work is aimed to process sequence queries over single data stream. Situations where streaming data arrive from multiple sources have not been explored much. In traditional approaches a single centralized machine handles and processes sequence queries over multiple data streams. While running sequence queries on a single server, even though many of the events in data streams do not lead to successful pattern detection they are still handled and processed by the server. This consumes precious network bandwidth, server’s computing resources and precious time. In this paper we focus on sequence pattern detection, where patterns are defined on chains of events that arrive from multiple distributed data streams. We propose a three layer distributed framework to avoid unnecessary event processing by the server, and to efficiently process sequence queries to detect sequence patterns relying upon chains of events. The bottom layer of data sources sends continuous data streams to the middle layer, which then performs pattern detection locally, and on the basis of the feedback received from the top layer of global server, sends events to the global server to detect complete patterns. Our present work is aimed to detect sequence patterns over multiple data streams, but, our proposed model can be extended to many other areas of distributed stream processing.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Lu, H., Zhou, Y., Haustad, J.: Continuous skyline monitoring over distributed data streams. In: Gertz, M., Ludäscher, B. (eds.) SSDBM 2010. LNCS, vol. 6187, pp. 565–583. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  2. Brenna, L., Gehrke, J., Hong, M., Johansen, D.: Distributed event stream processing with non-deterministic finite automata. In: Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, p. 3. ACM (2009)

    Google Scholar 

  3. Golab, L., Özsu, M.T.: Issues in data stream management. ACM Sigmod Rec. 32(2), 5–14 (2003)

    Article  Google Scholar 

  4. Liu, M., Li, M., Golovnya, D., Rundensteiner, E.A., Claypool, K.: Sequence pattern query processing over out-of-order event streams. In: 2009 IEEE 25th International Conference on Data Engineering. ICDE 2009, pp. 784–795. IEEE (2009)

    Google Scholar 

  5. Stonebraker, M., Çetintemel, U., Zdonik, S.: The 8 requirements of real-time stream processing. ACM SIGMOD Rec. 34(4), 42–47 (2005)

    Article  Google Scholar 

  6. Kawashima, H., Kitagawa, H., Li, X.: Complex event processing over uncertain data streams. In: 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), pp. 521–526. IEEE (2010)

    Google Scholar 

  7. Ramakrishnan, R., Cheng, M., Livny, M., Seshadri, P.: What’s next? sequence queries. In: Proceedings of International Conference Management of Data. Citeseer (1994)

    Google Scholar 

  8. Wang, Y., Cao, K., Zhang, X.: Complex event processing over distributed probabilistic event streams. Comput. Math. Appl. 66(10), 1808–1821 (2013)

    Article  MATH  Google Scholar 

  9. Jiang, Q., Chakravarthy, S.: Scheduling strategies for processing continuous queries over streams. In: Williams, H., MacKinnon, L.M. (eds.) BNCOD 2004. LNCS, vol. 3112, pp. 16–30. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  10. Sharaf, M.A., Labrinidis, A., Chrysanthis, P.K.: Scheduling continuous queries in data stream management systems. Proc. VLDB Endowment 1(2), 1526–1527 (2008)

    Article  Google Scholar 

  11. Mani, M.: Efficient event stream processing: handling ambiguous events and patterns with negation. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds.) DASFAA Workshops 2011. LNCS, vol. 6637, pp. 415–426. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  12. Schultz-Møller, N.P., Migliavacca, M., Pietzuch, P.: Distributed complex event processing with query rewriting. In: Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, p. 4. ACM (2009)

    Google Scholar 

  13. Agrawal, J., Diao, Y., Gyllstrom, D., Immerman, N.: Efficient pattern matching over event streams. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 147–160. ACM (2008)

    Google Scholar 

  14. Babcock, B., Babu, S., Datar, M., Motwani, R., Thomas, D.: Operator scheduling in data stream systems. VLDB J. Int. J. Very Large Data Bases 13(4), 333–353 (2004)

    Article  Google Scholar 

  15. Seshadri, P., Livny, M., Ramakrishnan, R.: Sequence query processing. In: ACM SIGMOD Record, vol. 23, pp. 430–441. ACM (1994)

    Google Scholar 

  16. Wu, J., Tan, K.-L., Zhou, Y.: QoS-oriented multi-query scheduling over data streams. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 215–229. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  17. Diao, Y., Immerman, N., Gyllstrom, D.: Sase+: An Agile Language for Kleene Closure Over Event Streams. ACM Press, New York (2007)

    Google Scholar 

  18. Law, Y.N., Wang, H., Zaniolo, C.: Query languages and data models for database sequences and data streams. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases-Volume 30, VLDB Endowment, pp. 492–503 (2004)

    Google Scholar 

  19. Wu, E., Diao, Y., Rizvi, S.: High-performance complex event processing over streams. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp. 407–418. ACM (2006)

    Google Scholar 

  20. Sadoghi, M., Singh, H., Jacobsen, H.A.: Towards highly parallel event processing through reconfigurable hardware. In: Proceedings of the Seventh International Workshop on Data Management on New Hardware, pp. 27–32. ACM (2011)

    Google Scholar 

  21. Mei, Y., Madden, S.: Zstream: a cost-based query processor for adaptively detecting composite events. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pp. 193–206. ACM (2009)

    Google Scholar 

  22. Balkesen, C., Dindar, N., Wetter, M., Tatbul, N.: Rip: Run-based intra-query parallelism for scalable complex event processing. In: Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems, pp. 3–14. ACM (2013)

    Google Scholar 

  23. Hirzel, M.: Partition and compose: Parallel complex event processing. In: Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems, pp. 191–200. ACM (2012)

    Google Scholar 

  24. Zhou, Y., Ma, C., Guo, Q., Shou, L., Chen, G.: Sequence pattern matching over time-series data with temporal uncertainty. In: EDBT, pp. 205–216 (2014)

    Google Scholar 

  25. Leghari, A.K., Wolf, M., Zhou, Y.: Efficient pattern detection over a distributed framework. In: Castellanos, M., Dayal, U., Pedersen, T.B., Tatbul, N. (eds.) BIRTE 2013 and 2014. LNBIP, vol. 206, pp. 133–149. Springer, Heidelberg (2015)

    Google Scholar 

  26. Wu, J., Zhou, Y., Aberer, K., Tan, K.L.: Towards integrated and efficient scientific sensor data processing: a database approach. In: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, pp. 922–933. ACM (2009)

    Google Scholar 

  27. http://www.infochimps.com/. 03 December 2014

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahmed Khan Leghari .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Leghari, A.K., Cao, J., Zhou, Y. (2015). Distributed Sequence Pattern Detection Over Multiple Data Streams. In: Tadeusz, M., Valduriez, P., Bellatreche, L. (eds) Advances in Databases and Information Systems. ADBIS 2015. Lecture Notes in Computer Science(), vol 9282. Springer, Cham. https://doi.org/10.1007/978-3-319-23135-8_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23135-8_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23134-1

  • Online ISBN: 978-3-319-23135-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics