Stability Analysis of a Statistical Model for Cloud Resource Management

  • Mitalee SarkerEmail author
  • Stefan Wesner
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11819)


In this paper, we presented a comprehensive stability analysis of statistical models derived from the network usage data to design an efficient and optimal resource management in a Cloud data centre. In recent years, it has been noticed that network has a significant impact on the HPC and business critical applications when they are run in a cloud environment. The existing VM placement algorithms lack capabilities to deploy such applications in an effective way and cause performance degradation. As a result, there is an urge for a network-aware VM placement algorithm which will consider the application behaviour and system capability. Our approach uses static models based on simple probability distribution concept and partition (number theory) to characterise and predict the resource usage behaviour of the VMs. However, the stability of those models is a key requirement to ensure a persistent placement of the VMs which can prevent their frequent migration and keep the infrastructure rigid. The paper investigates the stability of the models with respect to time. Sticky HDP-HMM method was proven highly capable to model the monitoring data with a certain accuracy. The refined data was further used to estimate the resource consumption of each VM and physical host running in the infrastructure. A stability parameter has been defined to determine the level of steadiness of the models that gives us a clear indication on whether the models can be used further to derive an optimal placement decision for new VMs. The paper ends with a discussion on instance based stability analysis and future work.


Cloud data centre Network VM placement 



The research leading to these results has received funding from the EC’s Framework Programme HORIZON 2020 under grant agreement number 732258 (CloudPerfect).


  1. 1.
    Adegboyega, A.: Time-series models for cloud workload prediction: a comparison. In: 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), pp. 298–307. IEEE (2017)Google Scholar
  2. 2.
    Azure, M.: High-performance computing. Accessed 24 May 2019
  3. 3.
    Balaji, P., Naik, H., Desai, N.: Understanding network saturation behavior on large-scale blue gene/p systems. In: 2009 15th International Conference on Parallel and Distributed Systems, pp. 586–593. IEEE (2009)Google Scholar
  4. 4.
    Barr, J.: New - predictive scaling for EC2, powered by machine learning. Accessed 25 May 2019
  5. 5.
    Oracle Cloud: HPC on oracle cloud infrastructure. Accessed 25 May 2019
  6. 6.
    Fox, E.B., Sudderth, E.B., Jordan, M.I., Willsky, A.S.: The sticky HDP-HMM: Bayesian nonparametric hidden Markov models with persistent states. Arxiv preprint (2007)Google Scholar
  7. 7.
    Fuerst, C., Schmid, S., Suresh, L., Costa, P.: Kraken: online and elastic resource reservations for cloud datacenters. IEEE/ACM Trans. Network. (TON) 26(1), 422–435 (2018)CrossRefGoogle Scholar
  8. 8.
    Ghiasi, A., Baca, R.: Overview of largest data centers, May 2014. Accessed 19 Apr 2018
  9. 9.
    Heyman, D.P., Tabatabai, A., Lakshman, T.: Statistical analysis and simulation study of video teleconference traffic in ATM networks. IEEE Trans. Circuits Syst. Video Technol. 2(1), 49–59 (1992)CrossRefGoogle Scholar
  10. 10.
    Hubner, F., Tran-Gia, P.: Quasi-stationary analysis of a finite capacity asynchronous multiplexer with modulated deterministic input. ITC-13, Copenhagen (1991)Google Scholar
  11. 11.
    Mehrotra, P., et al.: Performance evaluation of Amazon EC2 for NASA HPC applications. In: Proceedings of the 3rd Workshop on Scientific Cloud Computing, pp. 41–50. ACM (2012)Google Scholar
  12. 12.
    Mouchet, M.: Statistical characterisation of RTT series. Accessed 27 May 2019
  13. 13.
    Nolan, J.: Stable Distributions: Models for Heavy-Tailed Data. Birkhauser, New York (2003)CrossRefGoogle Scholar
  14. 14.
    OpenStackCommunity: Openstack compute schedulers. Accessed 06 June 2018
  15. 15.
    Popescu, D.A., Zilberman, N., Moore, A.W.: Characterizing the impact of network latency on cloud-based applications’ performance (2017)Google Scholar
  16. 16.
    Sarker, M., Wesner, S.: Statistical model based cloud resource management. In: Coppola, M., Carlini, E., D’Agostino, D., Altmann, J., Bañares, J.Á. (eds.) GECON 2018. LNCS, vol. 11113, pp. 107–115. Springer, Cham (2019). Scholar
  17. 17.
    Amazon Web Services: High performance computing (HPC). Accessed 24 May 2019
  18. 18.
    Son, J., Buyya, R.: Priority-aware VM allocation and network bandwidth provisioning in software-defined networking (SDN)-enabled clouds. IEEE Trans. Sustain. Comput. 4, 17–28 (2018)CrossRefGoogle Scholar
  19. 19.
    Teh, Y.W., Jordan, M.I., Beal, M.J., Blei, D.M.: Hierarchical Dirichlet processes. J. Am. Stat. Assoc. 101 (2004)MathSciNetCrossRefGoogle Scholar
  20. 20.
    Witt, C., Bux, M., Gusew, W., Leser, U.: Predictive performance modeling for distributed computing using black-box monitoring and machine learning. arXiv preprint arXiv:1805.11877 (2018)
  21. 21.
    Yildirim, I.: Bayesian inference: Gibbs sampling. Technical Note, University of Rochester (2012)Google Scholar
  22. 22.
    Yu, L., Shen, H., Cai, Z., Liu, L., Pu, C.: Towards bandwidth guarantee for virtual clusters under demand uncertainty in multi-tenant clouds. IEEE Trans. Parallel Distrib. Syst. 29(2), 450–465 (2017)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Institute of Information Resource ManagementUlm UniversityUlmGermany

Personalised recommendations