Skip to main content

A Technique to Identify Data Exchange Between Cloud Virtual Machines

  • Chapter
  • First Online:
Systems Modeling: Methodologies and Tools

Abstract

Modern cloud data centers typically exploit management strategies to reduce the overall energy consumption. While most of the solutions focus on the energy consumption due to computational elements, the optimization of network-related aspects of a data center is becoming more and more important, considering also the advent of the Software-Defined Network paradigm. However, an enabling step to implement network-aware Virtual Machine (VM) allocation is the knowledge of data exchange patterns. In this way we can place in well-connected hosts (or on the same physical host) the couples of VMs that exchange a large amount of information. Unfortunately, in Infrastructure as a Service data centers, a detailed knowledge on VMs data exchange is seldom available without the deployment of a specialized (and costly) monitoring infrastructure. In this paper, we propose a technique to infer VMs communication patterns starting from input/output network traffic time series of each VM. We discuss both the theoretical aspect of such technique and the design challenges for its implementation. A case study is used to demonstrate the viability of our idea.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://aws.amazon.com/cloudwatch/.

  2. 2.

    http://pandas.pydata.org/.

  3. 3.

    http://scikit-learn.org/.

  4. 4.

    www.tpc.org/tpcw/.

References

  1. M. Al-Fares, A. Loukissas, A. Vahdat, A scalable, commodity data center network architecture, in Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication, SIGCOMM ’08, New York, NY (ACM, New York, 2008), pp. 63–74. http://doi.acm.org/10.1145/1402958.1402967

    Book  Google Scholar 

  2. E. Amigó, J. Gonzalo, J. Artiles, F. Verdejo, A comparison of extrinsic clustering evaluation metrics based on formal constraints. J. Inf. Retr. 12(4), 461–486 (2009)

    Article  Google Scholar 

  3. M. Andreolini, M. Colajanni, M. Pietri, A scalable architecture for real-time monitoring of large information systems, in Proceedings of IEEE Symposium on Network Cloud Computing and Applications, London (2012)

    Google Scholar 

  4. H. Ballani, P. Costa, T. Karagiannis, A. Rowstron, Towards predictable datacenter networks. ACM SIGCOMM Comput. Commun. Rev. 41(4), 242–253 (2011)

    Article  Google Scholar 

  5. A. Beloglazov, J. Abawajy, R. Buyya, Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Futur. Gener. Comput. Syst. 28(5), 755–768 (2012)

    Article  Google Scholar 

  6. D. Boru, D. Kliazovich, F. Granelli, P. Bouvry, A.Y. Zomaya, Energy-efficient data replication in cloud computing datacenters. Clust. Comput. 18(1), 385–402 (2015)

    Article  Google Scholar 

  7. C. Canali, R. Lancellotti, Exploiting classes of virtual machines for scalable IaaS cloud management, in Proceedings of the 4th Symposium on Network Cloud Computing and Applications (NCCA) (2015)

    Google Scholar 

  8. C. Canali, R. Lancellotti, Identifying communication patterns between virtual machines in software-defined data centers. SIGMETRICS Perform. Eval. Rev. 44(4), 49–56 (2017)

    Article  Google Scholar 

  9. C. Canali, R. Lancellotti, M. Shojafar, A computation- and network-aware energy optimization model for virtual machines allocation, in Proceedings of International Conference on Cloud Computing and Services Science (CLOSER 2017), Porto (2017)

    Google Scholar 

  10. R. Castro, M. Coates, G. Liang, R. Nowak, B. Yu, Network tomography: recent developments. Stat. Sci. 19, 499–517 (2004)

    Article  MathSciNet  Google Scholar 

  11. W.H. Day, H. Edelsbrunner, Efficient algorithms for agglomerative hierarchical clustering methods. J. Classif. 1(1), 7–24 (1984)

    Article  Google Scholar 

  12. M. Dayarathna, Y. Wen, R. Fan, Data center energy consumption modeling: a survey. IEEE Commun. Surv. Tutorials 18(1), 732–794 (2016)

    Article  Google Scholar 

  13. P.T. Eugster, R. Guerraoui, A.M. Kermarrec, L. Massoulieacute, Epidemic information dissemination in distributed systems. Computer 37(5), 60–67 (2004). http://doi.ieeecomputersociety.org/10.1109/MC.2004.1297243

    Article  Google Scholar 

  14. B.J. Frey, D. Dueck, Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)

    Article  MathSciNet  Google Scholar 

  15. M. Jelasity, A. Montresor, O. Babaoglu, Gossip-based aggregation in large dynamic networks. ACM Trans. Comput. Syst. 23(3), 219–252 (2005)

    Article  Google Scholar 

  16. J.P. Kowalski, B. Warfield, Modelling traffic demand between nodes in a telecommunications network, in Proceedings of ATNAC’95 (1995)

    Google Scholar 

  17. R. Lancellotti, C. Canali, A correlation-based methodology to infer communication patterns between cloud virtual machines, in Proceedings of the 10th EAI International Conference on Performance Evaluation Methodologies and Tools (VALUETOOLS), Taormina (2017), pp. 251–254

    Google Scholar 

  18. D. Li, N. Dai, F. Li, C. Xing, F. Dai, Estimating SDN traffic matrix based on online informative flow measurement method, in Proceedings of 2017 Fifth International Conference on Advanced Cloud and Big Data (CBD) (2017), pp. 75–80

    Google Scholar 

  19. U. Luxburg, A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007)

    Article  MathSciNet  Google Scholar 

  20. A. Marotta, S. Avallone, A simulated annealing based approach for power efficient virtual machines consolidation, in Proceedings of 8th International Conference on Cloud Computing (CLOUD), IEEE (2015)

    Google Scholar 

  21. C. Mastroianni, M. Meo, G. Papuzzo, Probabilistic consolidation of virtual machines in self-organizing cloud data centers. IEEE Trans. Cloud Comput. 1(2), 215–228 (2013). https://doi.org/10.1109/TCC.2013.17

    Article  Google Scholar 

  22. X. Meng, V. Pappas, L. Zhang, Improving the scalability of data center networks with traffic-aware virtual machine placement, in Proceedings of the 29th Conference on Information Communications (INFOCOM), San Diego, CA (2010)

    Google Scholar 

  23. L. Myers, M.J. Sirois, Spearman Correlation Coefficients, Differences Between (Wiley, Hoboken, 2014). http://dx.doi.org/10.1002/9781118445112.stat02802

    Book  Google Scholar 

  24. K. Papagiannaki, N. Taft, A. Lakhina, A distributed approach to measure ip traffic matrices, in Proceedings of the 4th ACM SIGCOMM Conference on Internet Measurement (ACM, New York, 2004), pp. 161–174

    Google Scholar 

  25. J. Sonnek, J. Greensky, R. Reutiman, A. Chandra, Starling: minimizing communication overhead in virtualized computing platforms using decentralized affinity-aware migration, in Proceedings of 39th International Conference on Parallel Processing (ICPP), San Diego, CA (2010)

    Google Scholar 

  26. C. Tebaldi, M. West, Bayesian inference on network traffic using link count data. J. Am. Stat. Assoc. 93(442), 557–573 (1998)

    Article  MathSciNet  Google Scholar 

  27. M. Yu, L. Jose, R. Miao, Software defined traffic measurement with opensketch, in Presented as part of the 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13), USENIX, Lombard, IL (2013), pp. 29–42

    Google Scholar 

  28. L. Yuan, C.N. Chuah, P. Mohapatra, Progme: towards programmable network measurement. IEEE/ACM Trans. Netw. 19(1), 115–128 (2011)

    Article  Google Scholar 

  29. Y. Zhang, N. Ansari, Hero: hierarchical energy optimization for data center networks. IEEE Syst. J. 9(2), 406–415 (2013)

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledge the support of the project S 2 C: Secure Software-defined Cloud funded by the University of Modena and Reggio Emilia.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Riccardo Lancellotti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Bicocchi, N., Canali, C., Lancellotti, R. (2019). A Technique to Identify Data Exchange Between Cloud Virtual Machines. In: Puliafito, A., Trivedi, K. (eds) Systems Modeling: Methodologies and Tools. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-92378-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-92378-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-92377-2

  • Online ISBN: 978-3-319-92378-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics