Skip to main content

Cost Minimization for Big Data Processing in Geo-Distributed Data Centers

  • Chapter
  • First Online:
Cloud Networking for Big Data

Part of the book series: Wireless Networks ((WN))

Abstract

As we have known, cloud networking provides the possibility of orchestrating all resources towards different optimisation goals. For data transferring between the storage units and the processing units in big batch data (e.g., credit billing data) processing, SDN enables the programmers to customize the data routing as needed. Communication cost of large volume data transferring is non-ignorable and shall be carefully addressed in the consideration of cost efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. L. Gu, D. Zeng, P. Li, and S. Guo, “Cost minimization for big data processing in geo-distributed data centers,” Emerging Topics in Computing, IEEE Transactions on, vol. PP, no. 99, pp. 1–10, 2014.

    Google Scholar 

  2. “Data Center Locations,” http://www.google.com/about/datacenters/inside/locations/index.html.

  3. R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu, “No “Power”Struggles: Coordinated Multi-level Power Management for the Data Center,” in Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). ACM, 2008, pp. 48–59.

    Google Scholar 

  4. L. Rao, X. Liu, L. Xie, and W. Liu, “Minimizing Electricity Cost: Optimization of Distributed Internet Data Centers in a Multi-Electricity-Market Environment,” in Proceedings of the 29th International Conference on Computer Communications (INFOCOM). IEEE, 2010, pp. 1–9.

    Google Scholar 

  5. Z. Liu, M. Lin, A. Wierman, S. H. Low, and L. L. Andrew, “Greening Geographical Load Balancing,” in Proceedings of International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS). ACM, 2011, pp. 233–244.

    Google Scholar 

  6. R. Urgaonkar, B. Urgaonkar, M. J. Neely, and A. Sivasubramaniam, “Optimal Power Cost Management Using Stored Energy in Data Centers,” in Proceedings of International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS). 2011, pp. 221–232.

    Google Scholar 

  7. B. L. Hong Xu, Chen Feng, “Temperature Aware Workload Management in Geo-distributed Datacenters,” in Proceedings of International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS). ACM, 2013, pp. 33–36.

    Google Scholar 

  8. S. A. Yazd, S. Venkatesan, and N. Mittal, “Boosting energy efficiency with mirrored data block replication policy and energy scheduler,” SIGOPS Oper. Syst. Rev., vol. 47, no. 2, pp. 33–40, 2013.

    Google Scholar 

  9. J. Dean and S. Ghemawat, “Mapreduce: simplified data processing on large clusters,” Communications of the ACM, vol. 51, no. 1, pp. 107–113, 2008.

    Google Scholar 

  10. I. Marshall and C. Roadknight, “Linking cache performance to user behaviour,” Computer Networks and ISDN Systems, vol. 30, no. 223, pp. 2123–2130, 1998.

    Google Scholar 

  11. H. Jin, T. Cheocherngngarn, D. Levy, A. Smith, D. Pan, J. Liu, and N. Pissinou, “Joint Host-Network Optimization for Energy-Efficient Data Center Networking,” in Proceedings of the 27th International Symposium on Parallel Distributed Processing (IPDPS), 2013, pp. 623–634.

    Google Scholar 

  12. R. Kaushik and K. Nahrstedt, “T*: A data-centric cooling energy costs reduction approach for Big Data analytics cloud,” in 2012 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2012, pp. 1–11.

    Google Scholar 

  13. S. Gunduz and M. Ozsu, “A poisson model for user accesses to web pages,” in Computer and Information Sciences - ISCIS 2003, ser. Lecture Notes in Computer Science. Springer, vol. 2869, pp. 332–339, 2003.

    Google Scholar 

  14. L. Kleinrock, “The latency/bandwidth tradeoff in gigabit networks,” Communications Magazine, IEEE, vol. 30, no. 4, pp. 36–40, 1992.

    Google Scholar 

  15. “Gurobi,” www.gurobi.com.

    Google Scholar 

  16. G. Lee, J. Lin, C. Liu, A. Lorek, and D. Ryaboy, “The Unified Logging Infrastructure for Data Analytics at Twitter,” Proc. VLDB Endow., vol. 5, no. 12, pp. 1771–1780, 2012.

    Google Scholar 

  17. G. Mishne, J. Dalton, Z. Li, A. Sharma, and J. Lin, “Fast data in the era of big data: Twitter’s real-time related query suggestion architecture,” in Proceedings of the 2013 international conference on Management of data. ACM, pp. 1147–1158, 2013.

    Google Scholar 

  18. P. Bodík, I. Menache, M. Chowdhury, P. Mani, D. A. Maltz, and I. Stoica, “Surviving failures in bandwidth-constrained datacenters,” in Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication. ACM, pp. 431–442, 2012.

    Google Scholar 

  19. K. yin Chen, Y. Xu, K. Xi, and H. Chao, “Intelligent virtual machine placement for cost efficiency in geo-distributed cloud systems,” in Communications (ICC), 2013 IEEE International Conference on, pp. 3498–3503, 2013.

    Google Scholar 

  20. A. Greenberg, J. Hamilton, D. A. Maltz, and P. Patel, “The Cost of a Cloud: Research Problems in Data Center Networks,” SIGCOMM Comput. Commun. Rev., vol. 39, no. 1, pp. 68–73, Dec. 2008.

    Google Scholar 

  21. Y. Chen, S. Jain, V. Adhikari, Z.-L. Zhang, and K. Xu, “A first look at inter-data center traffic characteristics via yahoo! datasets,” in INFOCOM, 2011 Proceedings IEEE, IEEE, pp. 1620–1628, 2011.

    Google Scholar 

  22. K. You, B. Tang, Z. Qian, S. Lu, and D. Chen, “Qos-aware placement of stream processing service,” The Journal of Supercomputing, vol. 64, no. 3, pp. 919–941, 2013.

    Google Scholar 

  23. H. Ballani, K. Jang, T. Karagiannis, C. Kim, D. Gunawardena, and G. O’Shea, “Chatty Tenants and the Cloud Network Sharing Problem,” in Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation. USENIX Association, 2013, pp. 171–184.

    Google Scholar 

  24. K. LaCurts, S. Deng, A. Goyal, and H. Balakrishnan, “Choreo: network-aware task placement for cloud applications,” in Proceedings of the 2013 conference on Internet measurement conference. ACM, 2013, pp. 191–204.

    Google Scholar 

  25. C.-G. Lee and Z. Ma, “The generalized quadratic assignment problem,” Research Rep., Dept., Mechanical Industrial Eng., Univ. Toronto, Canada, 2004.

    Google Scholar 

  26. B. Chinoy and H.-W. Braun, “The National Science Foundation Network,” Technical Report GA-A21029, SDSC, Tech. Rep., 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Zeng, D., Gu, L., Guo, S. (2015). Cost Minimization for Big Data Processing in Geo-Distributed Data Centers. In: Cloud Networking for Big Data. Wireless Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-24720-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24720-5_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24718-2

  • Online ISBN: 978-3-319-24720-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics