Skip to main content

Data Stream Operations as First-Class Entities in Component-Based Performance Models

  • Conference paper
  • First Online:
Software Architecture (ECSA 2020)

Abstract

Data streaming applications are an important class of data-intensive systems. Performance is an essential quality of such systems. It is, for example, expressed by the delay of analysis results or the utilization of system resources. Architecture-level decisions such as the configuration of sources, sinks and operations, their deployment or the choice of technology impact the performance. Current component-based performance prediction approaches cannot accurately predict the performance of those systems, because they do not support the metrics that are specific to data streaming applications and only approximate the behavior of data stream operations instead of expressing it explicitly. In particular, operations that group multiple data events and thus introduce timing dependencies between different calls to the system are not represented sufficiently. In this paper, we present an approach for modeling networks of data stream operations including their parameters with the goal of predicting the performance of the resulting composed data streaming application. The approach is based on a component-based performance model with queueing semantics for processing resources. Our evaluation shows that our model can more accurately express the behavior of the system, resulting in a more expressive performance model compared to a well-encapsulated component-based model without data stream operations.

Partially funded by the German Research Foundation (DFG) as part of the Research Training Group GRK 2153: “Energy Status Data – Informatics Methods for its Collection, Analysis and Exploitation”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The data is available publicly via the website of the challenge [9].

References

  1. Aliabadi, S.K., et al.: Analytical composite performance models for big data applications. J. Netw. Comput. Appl. 142, 63–75 (2019)

    Article  Google Scholar 

  2. Arasu, A., Babu, S., Widom, J.: The CQL continuous query language: semantic foundations and query execution. VLDB J. 15(2), 121–142 (2006)

    Article  Google Scholar 

  3. Casale, G., Li, C.: Enhancing big data application design with the DICE framework. In: Mann, Z.Á., Stolz, V. (eds.) ESOCC 2017. CCIS, vol. 824, pp. 164–168. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-79090-9_13

    Chapter  Google Scholar 

  4. Castiglione, A., et al.: Modeling performances of concurrent big data applications. Softw. Pract. Exper. 45(8), 1127–1144 (2015)

    Article  Google Scholar 

  5. DICE consortium: Deliverable 2.4 DICE Deployment Abstractions, European Union’s Horizon 2020 programme (2017). http://www.dice-h2020.eu/deliverables/

  6. DICE consortium: Deliverable 3.4 DICE simulation tools, European Union’s Horizon 2020 programme (2017). http://www.dice-h2020.eu/deliverables/

  7. Happe, L., Buhnova, B., Reussner, R.: Stateful component-based performance models. Softw. Syst. Model. 13(4), 1319–1343 (2013). https://doi.org/10.1007/s10270-013-0336-6

    Article  Google Scholar 

  8. Hummel, O., et al.: A collection of software engineering challenges for big data system development. In: Euromicro SEAA, pp. 362–369. IEEE (2018)

    Google Scholar 

  9. Jerzak, Z., Ziekow, H.: DEBS 2014 grand challenge: smart homes - DEBS.org. https://debs.org/grand-challenges/2014/

  10. Jerzak, Z., Ziekow, H.: The DEBS 2014 grand challenge. In: DEBS 2014, pp. 266–269. ACM (2014)

    Google Scholar 

  11. Kroß, J., Krcmar, H.: Model-based performance evaluation of batch and stream applications for big data. In: MASCOTS, pp. 80–86. IEEE (2017)

    Google Scholar 

  12. Kroß, J., Krcmar, H.: PerTract: model extraction and specification of big data systems for performance prediction by the example of apache spark and hadoop. Big Data Cogn. Comput. 3(3), 47 (2019)

    Article  Google Scholar 

  13. Maddodi, G., Jansen, S., Overeem, M.: Aggregate architecture simulation in event-sourcing applications using layered queuing networks. In: ICPE 2020, pp. 238–245. ACM (2020)

    Google Scholar 

  14. Mazkatli, M., et al.: Incremental calibration of architectural performance models with parametric dependencies. In: ICSA 2020. IEEE (2020)

    Google Scholar 

  15. Meijer, E.: Your mouse is a database. ACM Queue 10(3), 20 (2012)

    Article  Google Scholar 

  16. Rathfelder, C.: Modelling event-based interactions in component-based architectures for quantitative system evaluation. In: The Karlsruhe Series on Software Design and Quality, KIT Scientific Publishing (2013)

    Google Scholar 

  17. Reussner, R.H., et al.: Modeling and Simulating Software Architectures - The Palladio Approach. MIT Press, Cambridge (2016)

    Google Scholar 

  18. Sachs, K.: Performance modeling and benchmarking of event-based systems. Ph.D. thesis, Darmstadt University of Technology (2011)

    Google Scholar 

  19. Werle, D.: GitHub repository of palladio indirections. https://github.com/PalladioSimulator/Palladio-Addons-Indirections

  20. Werle, D.: Data Stream Operations as First-Class Entities in Component-Based Performance Models - Auxiliary Material (2020). https://doi.org/10.5281/zenodo.3937718

    Article  Google Scholar 

  21. Werle, D., Seifermann, S., Koziolek, A.: Data stream operations as first-class entities in palladio. In: SSP 2019. Softwaretechnik Trends (2019)

    Google Scholar 

  22. Wu, E., Diao, Y., Rizvi, S.: High-performance complex event processing over streams. In: SIGMOD, pp. 407–418. ACM (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dominik Werle .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Werle, D., Seifermann, S., Koziolek, A. (2020). Data Stream Operations as First-Class Entities in Component-Based Performance Models. In: Jansen, A., Malavolta, I., Muccini, H., Ozkaya, I., Zimmermann, O. (eds) Software Architecture. ECSA 2020. Lecture Notes in Computer Science(), vol 12292. Springer, Cham. https://doi.org/10.1007/978-3-030-58923-3_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-58923-3_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-58922-6

  • Online ISBN: 978-3-030-58923-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics