Skip to main content

Scalable Online Analytics on Cloud Infrastructures

  • Conference paper
  • First Online:
Advances in Computing and Data Sciences (ICACDS 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 721))

Included in the following conference series:

  • 1583 Accesses

Abstract

The need for low latency analysis of high velocity real time continuous data streams has led to the emergence of Stream Processing Systems (SPSs). Contemporary SPSs allow a stream processing application to be hosted on Cloud infrastructures and dynamically scaled so as to adapt to the fluctuating data rates. However, the run time scalability incorporated in these SPSs are in their early adaptations and are based on simple local/global threshold based controls. This work studies the issues with the local and global auto scaling techniques that may lead to performance inefficiencies in real time traffic analysis on Cloud platforms and presents an efficient hybrid auto scaling strategy StreamScale which addresses the identified issues. The proposed StreamScale auto-scaling algorithm accounts for the gaps in the local/global scaling approaches and effectively identifies (de)parallelization opportunities in stream processing applications for maintaining QoS at reduced costs. Simulation based experimental evaluation on representative stream application topologies indicate that the proposed StreamScale auto-scaling algorithm exhibits better performance in comparison to both local and global auto-scaling approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abadi, D.J., Ahmad, Y., Balazinska, M., Çetintemel, U., Cherniack, M., Hwang, J.-H., Lindner, W., Maskey, A., Rasin, A., Ryvkina, E., Tatbul, N., Xing, Y. and Zdonik, S.B.: The design of the Borealis stream processing engine. In: CIDR, pp. 277–289 (2005)

    Google Scholar 

  2. Arasu, A., Babcock, B., Babu, S., Datar, M., Ito, K., Motwani, R., Nishizawa, I., Srivastava, U., Thomas, D., Varma, R., Widom, J.: STREAM: the stanford stream data manager. IEEE Data Eng. Bull. 26, 19–26 (2003)

    Google Scholar 

  3. Biem, A., Bouillet, E., Feng, H.: IBM infosphere streams for scalable, real-time, intelligent transportation services. In: SIGMOD 2010: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp. 1093–1103 (2010)

    Google Scholar 

  4. Toshniwal, A., Donham, J., Bhagat, N., Mittal, S., Ryaboy, D., Taneja, S., Shukla, A., Ramasamy, K., Patel, J.M., Kulkarni, S., Jackson, J., Gade, K., Fu, M.: Storm@twitter. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data - SIGMOD 2014, pp. 147–156 (2014)

    Google Scholar 

  5. Google Compute Engine. https://developers.google.com/compute/pricing

  6. Gulisano, V., Jiménez-Peris, R., Patiño-Mart́nez, M., Soriente, C., Valduriez, P.: StreamCloud: an elastic and scalable data streaming system. IEEE Trans. Parallel Distrib. Syst. 23(12), 2351–2365 (2012)

    Article  Google Scholar 

  7. Iosup, A., Yigitbasi, N., Epema, D.: On the performance variability of production cloud services. In: Proceedings - 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2011, pp. 104–113 (2011)

    Google Scholar 

  8. Jain, N., Amini, L., Andrade, H., King, R.: Design, implementation, and evaluation of the linear road benchmark on the stream processing core. In: Proceedings, pp. 431–442 (2006)

    Google Scholar 

  9. Kumbhare, A.G., Member, S., Simmhan, Y., Member, S.: Reactive resource provisioning heuristics for dynamic dataflows on cloud infrastructure. IEEE Trans. Cloud Comput. 3(2), 105–118 (2015)

    Article  Google Scholar 

  10. Satzger, B., Hummer, W., Leitner, P., Dustdar, S.: Esc: towards an elastic stream computing platform for the cloud. In: Proceedings - 2011 IEEE 4th International Conference on Cloud Computing, CLOUD 2011, pp. 348–355 (2011)

    Google Scholar 

  11. Schad, J., Dittrich, J., Quiané-Ruiz, J.-A.: Runtime measurements in the cloud observing analyzing and reducing variance. Proc. VLDB Endow. 3(1–2), 460–471 (2010)

    Article  Google Scholar 

  12. StreamBase Complex Event Processing – Overview—TIBCO: 2013. http://www.tibco.com/products/event-processing/complex-event-processing/streambase-complex-event-processing

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jyoti Sahni .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Sahni, J., Vidyarthi, D.P. (2017). Scalable Online Analytics on Cloud Infrastructures. In: Singh, M., Gupta, P., Tyagi, V., Sharma, A., Ören, T., Grosky, W. (eds) Advances in Computing and Data Sciences. ICACDS 2016. Communications in Computer and Information Science, vol 721. Springer, Singapore. https://doi.org/10.1007/978-981-10-5427-3_43

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-5427-3_43

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-5426-6

  • Online ISBN: 978-981-10-5427-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics