An Architecture for Automatic Scaling of Replicated Services

Aniello, Leonardo; Bonomi, Silvia; Lombardi, Federico; Zelli, Alessandro; Baldoni, Roberto

doi:10.1007/978-3-319-09581-3_9

Leonardo Aniello^17,18,
Silvia Bonomi^17,18,
Federico Lombardi^17,18,
Alessandro Zelli¹⁸ &
…
Roberto Baldoni^17,18

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 8593))

Included in the following conference series:

International Conference on Networked Systems

881 Accesses
12 Citations

Abstract

Replicated services that allow to scale dynamically can adapt to requests load. Choosing the right number of replicas is fundamental to avoid performance worsening when input spikes occur and to save resources when the load is low. Current mechanisms for automatic scaling are mostly based on fixed thresholds on CPU and memory usage, which are not sufficiently accurate and often entail late countermeasures. We propose Make Your Service Elastic (MYSE), an architecture for automatic scaling of generic replicated services based on queuing models for accurate response time estimation. Requests and service times patterns are analyzed to learn and predict over time their distribution so as to allow for early scaling. A novel heuristic is proposed to avoid the flipping phenomenon. We carried out simulations that show promising results for what concerns the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ali-Eldin, A., Tordsson, J., Elmroth, E.: An adaptive hybrid elasticity controller for cloud infrastructures. In: 2012 IEEE Network Operations and Management Symposium (NOMS), pp. 204–212 (2012)
Google Scholar
Baldoni, R., Lodi, G., Montanari, L., Mariotta, G., Rizzuto, M.: Online black-box failure prediction for mission critical distributed systems. In: Ortmeier, F., Lipaczewski, M. (eds.) SAFECOMP 2012. LNCS, vol. 7612, pp. 185–197. Springer, Heidelberg (2012)
Chapter Google Scholar
Barrett, E., Howley, E., Duggan, J.: Applying reinforcement learning towards automating resource allocation and application scalability in the cloud. Concurrency Comput.: Pract. Experience 25(12), 1656–1674 (2013)
Article Google Scholar
Biswas, S., Ahmad, S., Molla, M.K.I., Hirose, K., Nasser, M.: Kolmogorov-smirnov test in text-dependent automatic speaker identification. Eng. Lett. 16(4), 469–472 (2008)
Google Scholar
Bodík, P., Griffith, R., Sutton, C., Fox, A., Jordan, M., Patterson, D.: Statistical machine learning makes automatic control practical for internet datacenters. In: Proceedings of the 2009 Conference on Hot Topics in Cloud Computing, HotCloud’09. USENIX Association, Berkeley (2009)
Google Scholar
Cardosa, M., Chandra, A.: Resource bundles: using aggregation for statistical large-scale resource discovery and management. IEEE Trans. Parallel Distrib. Syst. 21(8), 1089–1102 (2010)
Article Google Scholar
Caron, E., Desprez, F., Muresan, A.: Forecasting for cloud computing on-demand resources based on pattern matching. Research RR-7217, Inria (2010)
Google Scholar
Chen, G., He, W., Liu, J., Nath, S., Rigas, L., Xiao, L., Zhao, F.: Energy-aware server provisioning and load dispatching for connection-intensive internet services. In: Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation (NSDI), pp. 337–350. USENIX Association (2008)
Google Scholar
Dutreilh, X., Rivierre, N., Moreau, A., Malenfant, J., Truck, I.: From data center resource allocation to control theory and back. In: 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD), pp. 410–417 (2010)
Google Scholar
Dutreilh, X., Kirgizov, S., Melekhova, O., Malenfant, J., Rivierre, N., Truck, I.: Using reinforcement learning for autonomic resource allocation in clouds: towards a fully automated workflow. In: The Seventh International Conference on Autonomic and Autonomous Systems, ICAS 2011, Venice/Mestre, Italy, pp. 67–74 (2011)
Google Scholar
Garlan, D., Cheng, S.W., Schmerl, B.: Increasing system dependability through architecture-based self-repair. In: de Lemos, R., Gacek, C., Romanovsky, A. (eds.) Architecting Dependable Systems. LNCS, vol. 2677, pp. 61–89. Springer, Heidelberg (2003)
Chapter Google Scholar
Ghanbari, H., Simmons, B., Litoiu, M., Barna, C., Iszlai, G.: Optimal autoscaling in a iaas cloud. In: Proceedings of the 9th International Conference on Autonomic Computing, ICAC ’12, pp. 173–178. ACM, New York (2012)
Google Scholar
Gong, Z., Gu, X., Wilkes, J.: Press: predictive elastic resource scaling for cloud systems. In: 2010 International Conference on Network and Service Management (CNSM), pp. 9–16 (2010)
Google Scholar
Han, R., Guo, L., Ghanem, M., Guo, Y.: Lightweight resource scaling for cloud applications. In: 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp. 644–651 (2012)
Google Scholar
Hasan, M., Magana, E., Clemm, A., Tucker, L., Gudreddi, S.: Integrated and autonomic cloud resource scaling. In: 2012 IEEE Network Operations and Management Symposium (NOMS), pp. 1327–1334 (2012)
Google Scholar
Huang, J., Li, C., Yu, J.: Resource prediction based on double exponential smoothing in cloud computing. In: 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet), pp. 2056–2060 (2012)
Google Scholar
Iqbal, W., Dailey, M.N., Carrera, D., Janecek, P.: Adaptive resource provisioning for read intensive multi-tier applications in the cloud. Future Gener. Comput. Syst. 27(6), 871–879 (2011)
Article Google Scholar
Islam, S., Keung, J., Lee, K., Liu, A.: Empirical prediction models for adaptive resource provisioning in the cloud. Future Gener. Comput. Syst. 28(1), 155–162 (2012)
Article Google Scholar
Lorido-Botrán, T., Miguel-Alonso, J., Lozano, J.A.: Auto-scaling techniques for elastic applications in cloud environments. Research EHU-KAT-IK, Department of Computer Architecture and Technology, UPV/EHU (2012)
Google Scholar
Maurer, M., Brandic, I., Sakellariou, R.: Enacting SLAs in clouds using rules. In: Jeannot, E., Namyst, R., Roman, J. (eds.) Euro-Par 2011, Part I. LNCS, vol. 6852, pp. 455–466. Springer, Heidelberg (2011)
Chapter Google Scholar
Mi, H., Wang, H., Yin, G., Zhou, Y., Shi, D., Yuan, L.: Online self-reconfiguration with performance guarantee for energy-efficient large-scale cloud computing data centers. In: 2010 IEEE International Conference on Services Computing (SCC), pp. 514–521 (2010)
Google Scholar
Moore, L.R., Bean, K., Ellahi, T.: Transforming reactive auto-scaling into proactive auto-scaling. In: Proceedings of the 3rd International Workshop on Cloud Data and Platforms, CloudDP ’13, pp. 7–12. ACM, New York (2013)
Google Scholar
Myung, I.J.: Tutorial on maximum likelihood estimation. J. Math. Psychol. 47(1), 90–100 (2003)
Article MATH MathSciNet Google Scholar
Padala, P., Hou, K.Y., Shin, K.G., Zhu, X., Uysal, M., Wang, Z., Singhal, S., Merchant, A.: Automated control of multiple virtualized resources. In: Proceedings of the 4th ACM European Conference on Computer Systems, EuroSys ’09, pp. 13–26. ACM, New York (2009)
Google Scholar
Park, S.M., Humphrey, M.: Self-tuning virtual machines for predictable escience. In: Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID ’09, pp. 356–363. IEEE Computer Society, Washington, DC (2009)
Google Scholar
Rao, J., Bu, X., Xu, C.Z., Wang, L., Yin, G.: Vconf: a reinforcement learning approach to virtual machines auto-configuration. In: Proceedings of the 6th International Conference on Autonomic Computing, ICAC ’09, pp. 137–146. ACM, New York (2009)
Google Scholar
Roy, N., Dubey, A., Gokhale, A.: Efficient autoscaling in the cloud using predictive models for workload forecasting. In: 2011 IEEE International Conference on Cloud Computing (CLOUD), pp. 500–507 (2011)
Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. Technical report, DTIC Document (1985)
Google Scholar
Shen, Z., Subbiah, S., Gu, X., Wilkes, J.: Cloudscale: elastic resource scaling for multi-tenant cloud systems. In: Proceedings of the 2nd ACM Symposium on Cloud Computing, SOCC ’11, pp. 5:1–5:14. ACM, New York (2011)
Google Scholar
Tesauro, G., Jong, N.K., Das, R., Bennani, M.N.: A hybrid reinforcement learning approach to autonomic resource allocation. In: Proceedings of the 2006 IEEE International Conference on Autonomic Computing, ICAC ’06, pp. 65–73. IEEE Computer Society, Washington, DC (2006)
Google Scholar
Urgaonkar, B., Shenoy, P., Chandra, A., Goyal, P., Wood, T.: Agile dynamic provisioning of multi-tier internet applications. ACM Trans. Auton. Adapt. Syst. 3(1), 1:1–1:39 (2008)
Article Google Scholar
Villela, D., Pradhan, P., Rubenstein, D.: Provisioning servers in the application tier for e-commerce systems. In: Twelfth IEEE International Workshop on Quality of Service, IWQOS 2004, pp. 57–66 (2004)
Google Scholar
Williams, A.W., Pertet, S.M., Narasimhan, P.: Tiresias: black-box failure prediction in distributed systems. In: IPDPS, pp. 1–8 (2007)
Google Scholar
Xu, J., Zhao, M., Fortes, J., Carpenter, R., Yousif, M.: On the use of fuzzy modeling in virtualized data center management. In: Fourth International Conference on Autonomic Computing, ICAC ’07, pp. 25–25 (2007)
Google Scholar
Zhang, G., Patuwo, B.E., Hu, M.Y.: Forecasting with artificial neural networks: the state of the art. Int. J. Forecast. 14(1), 35–62 (1998)
Article Google Scholar
Zhang, Q., Cherkasova, L., Smirni, E.: A regression-based analytic model for dynamic resource provisioning of multi-tier applications. In: Proceedings of the Fourth International Conference on Autonomic Computing, ICAC ’07, pp. 27–36. IEEE Computer Society, Washington, DC (2007)
Google Scholar

Download references

Acknowledgments

This work has been partially supported by the TENACE PRIN Project (n. 20103P34XC) funded by the Italian Ministry of Education, University and Research and by the academic project C26A133HZY funded by the University of Rome “La Sapienza”.

Author information

Authors and Affiliations

Cyber Intelligence and Information Security Research Center, University of Rome “La Sapienza”, Via Ariosto 25, 00185, Rome, Italy
Leonardo Aniello, Silvia Bonomi, Federico Lombardi & Roberto Baldoni
Department of Computer, Control, and Management Engineering “Antonio Ruberti”, University of Rome “La Sapienza”, Via Ariosto 25, 00185, Rome, Italy
Leonardo Aniello, Silvia Bonomi, Federico Lombardi, Alessandro Zelli & Roberto Baldoni

Authors

Leonardo Aniello
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Bonomi
View author publications
You can also search for this author in PubMed Google Scholar
Federico Lombardi
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Zelli
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Baldoni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leonardo Aniello .

Editor information

Editors and Affiliations

Northeastern University, Boston, Massachusetts, USA
Guevara Noubir
Université de Rennes 1, Rennes Cedex, France
Michel Raynal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aniello, L., Bonomi, S., Lombardi, F., Zelli, A., Baldoni, R. (2014). An Architecture for Automatic Scaling of Replicated Services. In: Noubir, G., Raynal, M. (eds) Networked Systems. NETYS 2014. Lecture Notes in Computer Science(), vol 8593. Springer, Cham. https://doi.org/10.1007/978-3-319-09581-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-09581-3_9
Published: 03 August 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09580-6
Online ISBN: 978-3-319-09581-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics