Skip to main content

Service Level Agreements for Job Control in Grid and High-Performance Computing

  • Chapter
  • First Online:

Part of the book series: Computer Communications and Networks ((CCN))

Abstract

Service Level Agreements (SLAs) are electronic contracts that are used to describe service levels for a plethora of tasks and situations, regardless of them being consumed offline or online. SLAs are being investigated already for a long time in the area of grid computing and, as well, by classical High-Performance Computing (HPC) providers. Most often, these investigations are either only on a high logical level above or at the middleware or on a low physical level below the middleware. In the first case, components are at best placed in the middleware layer but are not directly communicating with lower level resources; in the second case, SLAs are only used below the middleware and are not visible above. This work presents an approach for a solution to job submission and scheduling, called job control, using SLAs as long-term contracts in an integrated fashion across layers.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    For example, at the German high-performance computing centers in Jülich, Munich, and Stuttgart.

  2. 2.

    The High-Performance Computing Center Stuttgart, as well as the infrastructure provided by the German grid initiative D-Grid is, for example, still using the Globus Toolkit version 4.0, although this has been superseded by version 4.2 in February 2008 and version 5.0 in January 2010.

References

  1. Amazon Web Services LLC: Amazon EC2 SLA. Retrieved from http://aws.amazon.com/ec2-sla (2008)

  2. Andrieux, A., Czajkowski, K., Dan, A., Keahey, K., Ludwig, H., Nakata, T., Pruyne, J., Rofrano, J., Tuecke, S., Xu, M.: Web services agreement specification (WS-Agreement). OpenGridForum, pp. 1–81. Retrieved from https://forge.gridforum.org/sf/go/doc14574?nav=1 (2007)

  3. Anjomshoaa, A., Brisard, F., Drescher, M., Fellows, D., Ly, A., McGough, S., Pulsipher, D., Sawa, A.: Job Submission Description Language (JSDL) specification, version 1.0. OpenGridForum, pp. 1–72. Retrieved from http://www.gridforum.org/documents/GFD.56.pdf (2005)

  4. Argonne National Laboratories: OpenPBS Public Home. Retrieved from http://www.mcs.anl.gov/research/projects/openpbs (2010)

  5. Boniface, M., Phillips, S.C., Surridge, M.: Grid-based business partnerships using service level agreements. In: Proceedings of Cracow Grid Workshop, Cracow, pp. 16–18 (2006)

    Google Scholar 

  6. Chakrabarti, A.: Grid Computing Security. Springer, New York (2007)

    MATH  Google Scholar 

  7. Cluster Resources Inc.: TORQUE resource manager. Retrieved from http://www.clusterresources.com/products/torque-resourcemanager.php (2010)

  8. De Saint-Exupery, A.: Terre des hommes. Gallimard Education, Paris (1999)

    Google Scholar 

  9. Djemame, K., Gourlay, I., Padgett, J., Birkenheuer, G., Hovestadt, M., Kao, O., Voss, K.: Introducing risk management into the grid. In: Proceedings of the 2nd IEEE International Conference on E-Science and Grid Computing, Amsterdam (2006). doi:10.1109/E-SCIENCE.2006.92

  10. Dumitrescu, C., Raicu, I., Foster, I.: The design, usage, and performance of GRUBER: A GRid Usage Service Level Agreement based BrokERing Infrastructure. J. Grid Comput. 5(1), 99–126 (2005a)

    Article  Google Scholar 

  11. Dumitrescu, C., Raicu, I., Foster, I.: DI-GRUBER: a distributed approach to grid resource brokering. In: Proceedings of the ACM/IEEE 2005 Conference on Supercomputing, Seattle, p. 38 (2005b). doi:10.1109/SC.2005.23

  12. Dumitrescu, C.L., Raicu, I., Foster, I.: Usage SLA-based scheduling in grids: Research articles. Concurr. Comput. Pract. Exp. 19(7), 945–963 (2007)

    Article  Google Scholar 

  13. Ernest, L.M., Hamilton, R.A., Moraca, W.L., Seaman, J.W.: Managing compliance with service level agreements in a grid environment (No. 7668741). Retrieved from http://www.freepatentsonline.com/7668741.html (2010)

  14. Fawaz, W., Daheb, B., Audouin, O., Du-Pond, M., Pujolle, G.: Service Level Agreement and provisioning in optical networks. IEEE Commun. Mag. 42(1), 36–43 (2004)

    Article  Google Scholar 

  15. Graham, S., Hull, D., Murray, B.: Web services base notification 1.3. Retrieved from http://docs.oasis-open.org/wsn/wsn-ws_base_notification-1.3-spec-os.pdf (2006)

  16. Hayes, I.S.: Metrics for IT outsourcing Service Level Agreements. Clarity Consulting Publications. Retrieved from http://www.clarity-consulting.com/MetricsforIToutsourcing.pdf (2004)

  17. HPC4U (Highly Predictable Cluster for Internet-Grids) Consortium: HPC4U project home page. Retrieved from http://www.hpc4u.org (2006)

  18. Intel Inc.: Single-chip cloud computer. Retrieved from http://techresearch.intel.com/UserFiles/en-us/File/terascale/SCC-Overview.pdf (2009)

  19. IRMOS (Interactive Realtime Multimedia Applications on Service-Oriented Infrastructures) Consortium: IRMOS project home page. Retrieved from http://irmos-project.eu (2010)

  20. Keller, A., Ludwig, H.: The WSLA framework: Specifying and monitoring service level agreements for web services. J. Netw. Syst. Manage. 11(1), 57–81 (2003)

    Article  Google Scholar 

  21. Lamanna, D.D, Skene, J., Emmerich, W.: Slang: a language for defining service level agreements. In: Proceedings of the 9th IEEE Workshop on Future Trends of Distributed Computing Systems, San Juan, pp 100–106 (2003)

    Google Scholar 

  22. Lupton, G., Thulin, D.: Accelerating HPC using GPU’s. Retrieved from http://www.hp.com/techservers/hpccn/hpccollaboration/ADCatalyst/downloads/accelerating_HPC_Using_GPU%27s.pdf (2008)

  23. Lux Scientiae Inc.: Service level agreement for shared email & web hosting. Retrieved from http://luxsci.com/extranet/shared-sla.html (2010)

  24. MacLaren, J., Sakellariou, R., Krishnakumar, K.T., Garibaldi, J., Ouelhadj, D.: Towards service level agreement based scheduling on the grid. In: Proceedings of the 2nd European across Grids Conference, Nicosia, pp. 100–102 (2004)

    Google Scholar 

  25. Mahler, T., Arenas, A., Schubert, L.: Contractual frameworks for enterprise networks and virtual organizations in e-learning. In: Cunnigham, P., Cunningham, M. (eds.) Exploiting the Knowledge Economy: Issues, Applications, Case Studies, vol. 2. IOS Press, Amsterdam (2006)

    Google Scholar 

  26. Mangipudi, K., Basani, V.: Method and apparatus for implementing a service level agreement (No. 7058704). Retrieved from http://www.freepatentsonline.com/7058704.html (2006)

  27. Microsoft Corporation: Download details: Windows Azure Compute SLA document. Retrieved from http://go.microsoft.com/fwlink/?LinkId=159704 (2010)

  28. New Jersey City University, Department of Information Technology: Policy & procedures: Help desk Service Level Agreement (SLA). Retrieved from http://www.njcu.edu/dept/it/documents/helpdesk_sla-employees_sp08.pdf (2008)

  29. NextGRID Consortium: NextGRID project home page. Retrieved from http://nextgrid.org (2008)

  30. Raimondi, F., Skene, J., Emmerich, W.: Efficient online monitoring of web-service SLAs. In: Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of Software Engineering, Atlanta, pp. 170–180 (2008)

    Google Scholar 

  31. Resch, M: Entgeltordnung für die nutzung der rechenanlagen und peripheren geräte des höchstleistungsrechenzentrums Stuttgart (HLRS) an der Universität Stuttgart. Retrieved from http://www.hlrs.de/fileadmin/_assets/organization/sos/puma/services/Entgeltordnungen/Entgeltordnung_16-09-2008.pdf (2008)

  32. Saha, S., Biswas, A., Sen, S.: Modeling opponent decision in repeated one-shot negotiations. In: Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems, Utrecht, pp. 397–403 (2005)

    Google Scholar 

  33. Sakellariou, R., Yarmolenko, V.: On the flexibility of WS-agreement for job submission. In: Proceedings of the 3rd International Workshop on Middleware for Grid Computing, vol. 117, Grenoble, pp. 1–6 (2005)

    Google Scholar 

  34. Sakellariou, R., Yarmolenko, V.: Job scheduling on the grid: towards SLA-based scheduling. In: Grandinetti, L. (ed.) High Performance Computing and Grids in Action. IOS Press, Amsterdam (2008). Retrieved 03 Mar 2010 from http://www.cs.man.ac.uk/∼rizos/papers/hpc08.pdf

    Google Scholar 

  35. Sandholm, T.: Service level agreement requirements of an accounting-driven computational grid. Tech. Rep. No. TRITA-NA-0533. Royal Institute of Technology, Stockholm (2005)

    Google Scholar 

  36. Seidel, J., Wäldrich, O., Wieder, P., Yahyapour, R., Ziegler, W.: Using SLA for resource management and scheduling – a survey. In: Talia, D., Yahyapour, R., Ziegler, W. (eds.) Grid Middleware and Services – Challenges and Solutions. Springer, New York (2008)

    Google Scholar 

  37. Siebenlist, F., Welch, V.: The Globus toolkit authorization framework. Retrieved from http://www.globus.org/alliance/events/sc06/AuthZ.pdf (2006)

  38. University of California, Los Angeles: Cts call center service level agreement. Retrieved from http://www.cts.ucla.edu/pdfs/CallCenterSLA.pdf (2010)

  39. Wesner, S.: Integrated management framework for dynamic virtual organizations. Dissertation, Universität Stuttgart, Stuttgart (2008)

    Google Scholar 

  40. Yarmolenko, V., Sakellariou, R.: An evaluation of heuristics for SLA based parallel job scheduling. In: Proceedings of the 20th International Parallel and Distributed Processing Symposium, Rhodes Island, p. 8 (2006)

    Google Scholar 

  41. Yeo, C.S., Buyya, R.: Managing risk of inaccurate runtime estimates for deadline constrained job admission control in clusters. In: Proceedings of the 2006 International Conference on Parallel Processing, Columbus, pp. 451–458 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roland Kübert .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag London Limited

About this chapter

Cite this chapter

Kübert, R. (2011). Service Level Agreements for Job Control in Grid and High-Performance Computing. In: Preve, N. (eds) Grid Computing. Computer Communications and Networks. Springer, London. https://doi.org/10.1007/978-0-85729-676-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-0-85729-676-4_8

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-0-85729-675-7

  • Online ISBN: 978-0-85729-676-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics