Skip to main content

Evaluation and Development Perspectives of Stream Data Processing Systems

  • Conference paper
Computer Networks (CN 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 370))

Included in the following conference series:

Abstract

The following paper describes some common aspects of stream data processing systems. The paper consists of two main parts – first showing the short description, tests results and conclusions of an implemented system – the AGKPStream, while the second part focuses on proposed solutions, created upon experiences gained during development of mentioned system, as well as knowledge collected during learning about some concepts of a StreamAPAS system. The first discussed issue is a tuple construction – basic data representation. It concerns tuple time model, tuple schema and a tuple decorator. Afterwards, the stream query and scheduling problems are described.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abhirup, C., Ajit, S.: A Partition-based Approach to Support Streaming Updates over Persistent Data in an Active Data Warehouse. In: Proceedings of the 2009 IEEE International Symposium on Parallel & Distributed Processing, IPDPS 2009, pp. 1–11. IEEE Computer Society, Washington, DC (2009)

    Google Scholar 

  2. Gorawski, M.: Extended Cascaded Star Schema and ECOLAP Operations for Spatial Data Warehouse. In: Corchado, E., Yin, H. (eds.) IDEAL 2009. LNCS, vol. 5788, pp. 251–259. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  3. Gorawski, M.: Time complexity of page filling algorithms in Materialized Aggregate List (MAL) and MAL/TRIGG materialization cost. Control and Cybernetics 38(1), 153–172 (2009)

    MATH  Google Scholar 

  4. Gorawski, M., Gorawski, M.: Balanced spatio-temporal data warehouse with RMVB, STCAT and BITMAP indexes. In: PARELEC 2006: International Symposium On Parallel Computing In Electrical Engineering, pp. 43–48 (2006)

    Google Scholar 

  5. Gorawski, M., Malczok, R.: Indexing Spatial Objects in Stream Data Warehouse. In: Nguyen, N.T., Katarzyniak, R., Chen, S.-M. (eds.) Advances in Intelligent Information and Database Systems. SCI, vol. 283, pp. 53–65. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Gorawski, M., Marks, P.: Checkpoint-based resumption in data warehouses. In: Software Engineering Techniques: Design for Quality. IFIP, vol. 227, pp. 313–323. Springer, US (2006)

    Chapter  Google Scholar 

  7. Gorawski, M., Marks, P.: Resumption of data extraction process in parallel data warehouses. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 478–485. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  8. Gorawski, M., Morzy, T., Wrembel, R.: Special Issue on: Techniques of Advanced Data Processing and Analysis Introduction. Control and Cybernetics 38(1) (2009)

    Google Scholar 

  9. Kozielski, S., Wrembel, R. (eds.): New Trends in Data Warehousing and Data Analysis. Annals of Information Systems, vol. 3. Springer, US (2009)

    Google Scholar 

  10. Morzy, T.: Extraction, Transformation, and Loading Processes. In: Data Warehouses and Olap: Concepts, Architectures and Solutions, pp. 88–110 (2007)

    Google Scholar 

  11. Brian, B., Shivnath, B., Mayur, D., Rajeev, M., Dilys, T.: Operator scheduling in data stream systems. VLDB J. 13(4), 333–353 (2004)

    Article  Google Scholar 

  12. Gorawski, M.: Advanced Data Warehouses. Habilitation, Studia Informatica 30(3B). Pub. House of Silesian Univ. of Technology (2009)

    Google Scholar 

  13. Gorawski, M., Chrószcz, A.: Synchronization Modeling in Stream Processing. In: Morzy, T., Härder, T., Wrembel, R. (eds.) Advances in Databases and Information Systems. AISC, vol. 186, pp. 91–102. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  14. Gorawski, M., Malczok, R.: Towards stream data parallel processing in spatial aggregating index. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 209–218. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  15. Gorawski, M., Malczok, R.: Answering Range-Aggregate Queries over Objects Generating Data Streams. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5982, pp. 436–439. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  16. Gorawski, M., Marks, P.: Distributed stream processing analysis in high availability context. In: Proceedings of the Second International Conference on Availability, Reliability and Security, ARES, pp. 61–68 (2007)

    Google Scholar 

  17. Roger, S.B., Jonathan, G., Mohamed, H.A., Hong, M.: Consistent Streaming Through Time: A Vision for Event Stream Processing. In: Third Biennial Conference on Innovative Data Systems Research, CIDR 2007, Asilomar, CA, USA (2007)

    Google Scholar 

  18. Gorawski, M.: Architecture of Parallel Spatial Data Warehouse: Balancing Algorithm and Resumption of Data Extraction. In: Proceedings of the 2005 conference on Software Engineering: Evolution and Emerging Technologies, pp. 49–59. IOS Press, Amsterdam (2005)

    Google Scholar 

  19. Gorawski, M., Chroszcz, A.: Optimization of operator partitions in stream data warehouse. In: Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP, pp. 61–66. ACM, New York (2011)

    Chapter  Google Scholar 

  20. Gorawski, M., Gorawski, M.: Modified R-MVB tree and BTV algorithm used in a distributed spatio-temporal data warehouse. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 199–208. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  21. Gorawski, M., Marks, P.: Towards reliability and fault-tolerance of distributed stream processing system. In: DEPCOS-RELCOMEX 2007 International Conference on Dependability of Computer Systems, pp. 246–253. IEEE Computer Society, Washington, DC (2007)

    Chapter  Google Scholar 

  22. Gorawski, M., Marks, P., Gorawski, M.: Collecting data streams from a distributed radio-based measurement system. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds.) DASFAA 2008. LNCS, vol. 4947, pp. 702–705. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  23. Waas, F., Wrembel, R., Freudenreich, T., Theile, M., Koncilia, C., Furtado, P.: On-Demand ELT Architecture for Right-Time BI: Extending the Vision. International Journal on Data Warehousing and Mining (to appear, 2013)

    Google Scholar 

  24. Wrembel, R.: A Survey of Managing the Evolution of Data Warehouses. IJDWM 5(2), 24–56 (2009)

    Google Scholar 

  25. Gorawski, M., Chroszcz, A.: StreamAPAS: Query Language and Data Model. In: Proceedings of the Third International Conference of Complex, Intelligent and Software Intensive Systems, CISIS 2009, pp. 75–82. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  26. Gorawski, M., Chrószcz, A.: Query Processing Using Negative and Temporal Tuples in Stream Query Engines. In: Szmuc, T., Szpyrka, M., Zendulka, J. (eds.) CEE-SET 2009. LNCS, vol. 7054, pp. 70–83. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  27. Mohamed, A.S., Panos, K.C., Alexandros, L., Kirk, P.: Efficient scheduling of heterogeneous continuous queries. In: Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB 2006, pp. 511–522. Endowment (2006)

    Google Scholar 

  28. Timothy, M.S., Bradford, P., Zhu, Y., Luping, D., Elke, A.R.: An Adaptive Multi-Objective Scheduling Selection Framework for Continuous Query Processing. In: Proceedings of the 9th International Database Engineering & Application Symposium, IDEAS 2005, pp. 445–454. IEEE Computer Society, Washington, DC (2005)

    Google Scholar 

  29. Jestratjew, A., Kwiecien, A.: Performance of HTTP Protocol in Networked Control Systems. IEEE Trans. Industrial Informatics 9(1), 271–276 (2013)

    Article  Google Scholar 

  30. Patroumpas, K., Sellis, T.: Subsuming multiple sliding windows for shared stream computation. In: Eder, J., Bielikova, M., Tjoa, A.M. (eds.) ADBIS 2011. LNCS, vol. 6909, pp. 56–69. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  31. Gorawski, M., Marks, P.: Fault-tolerant distributed stream processing system. In: International Workshop on Database and Expert Systems Applications – DEXA, pp. 395–399 (2006)

    Google Scholar 

  32. Gorawski, M., Malczok, R.: AEC Algorithm: A Heuristic Approach to Calculating Density-Based Clustering Eps Parameter. In: Yakhno, T., Neuhold, E.J. (eds.) ADVIS 2006. LNCS, vol. 4243, pp. 90–99. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  33. Gorawski, M., Malczok, R.: Towards automatic Eps calculation in density-based clustering. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds.) ADBIS 2006. LNCS, vol. 4152, pp. 313–328. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  34. Gorawski, M., Marks, P.: Towards automated analysis of connections network in distributed stream processing system. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds.) DASFAA 2008. LNCS, vol. 4947, pp. 670–677. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  35. Gorawski, M., Lorek, M., Gorawska, A.: CUDA Powered User-Defined Types and Aggregates. In: International Workshop on Engineering Object-Oriented Parallel Software (IEEE AINA_EOOPS-2013). IEEE CS (to appear, 2013)

    Google Scholar 

  36. Jestratjew, A., Kwiecień, A.: Using Cloud Storage in Production Monitoring Systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2010. CCIS, vol. 79, pp. 226–235. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  37. Kwiecień, A., Sidzina, M.: Dual Bus as a Method for Data Interchange Transaction Acceleration in Distributed Real Time Systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2009. CCIS, vol. 39, pp. 252–263. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  38. Kwiecień, A., Opielka, K.: Industrial Networks in Explosive Atmospheres. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2011. CCIS, vol. 160, pp. 367–378. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  39. Skrzewski, M.: Analyzing Outbound Network Traffic. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2011. CCIS, vol. 160, pp. 204–213. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gorawski, M., Gorawska, A., Pasterak, K. (2013). Evaluation and Development Perspectives of Stream Data Processing Systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds) Computer Networks. CN 2013. Communications in Computer and Information Science, vol 370. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38865-1_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38865-1_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38864-4

  • Online ISBN: 978-3-642-38865-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics