Skip to main content

Group-Wise Performance Evaluation of Processor Co-allocation in Multi-cluster Systems

  • Conference paper
Job Scheduling Strategies for Parallel Processing (JSSPP 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4942))

Included in the following conference series:

Abstract

Performance evaluation in multi-cluster processor co-allocation - like in many other parallel job scheduling problems- is mostly done by computing the average metric value for the entire job stream. This does not give a comprehensive understanding of the relative performance of the different jobs grouped by their characteristics. It is however the characteristics that affect how easy/hard jobs are to schedule. We, therefore, do not get to understand scheduler performance at job type level. In this paper, we study the performance of multi-cluster processor co-allocation for different job groups grouped by their size, components and widest component. We study their relative performance, sensitivity to parameters and how their performance is affected by the heuristics used to break them up into components. We show that the widest component us characteristic that most affects job schedulability. We also show that to get better performance, jobs should be broken up in such a way that the width of the widest component is minimized.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bal, et al.: The Distributed ASCI Supercomputer Project. Operating Systems Review 34(4), 76–96 (2000)

    Article  Google Scholar 

  2. Bucur, A.I.D., Epema, D.H.J.: The Influence of the Structure and Sizes of Jobs on the Performance of Co-allocation. In: Feitelson, D.G., Rudolph, L. (eds.) IPDPS-WS 2000 and JSSPP 2000. LNCS, vol. 1911, pp. 154–173. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  3. Bucur, A.I.D., Epema, D.H.J.: The Performance of Processor Co-allocation in Multicluster Systems. In: proceedings of the 3rd IEEE/ ACM International Symposium on Cluster Computing and the Grid (CCGrid 2003), pp. 302–309 (2003)

    Google Scholar 

  4. Bucur, A.I.D.: Performance Analysis of Processor Co-allocation in Multicluster Systems. PhD Thesis, Delft University of Technology, Delft, The Netherlands (2004)

    Google Scholar 

  5. Bucur, A.I.D., Epema, D.H.J.: Local versus Global Schedulers with Processor Co-allocation in Multicluster Systems. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2002. LNCS, vol. 2537, pp. 184–204. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  6. Chiang, S.H., Arpaci-Dusseau, A., Vernon, M.K.: The Impact of More Accurate Requested Runtimes on Production Job Scheduling Performance. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2002. LNCS, vol. 2537, pp. 103–127. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  7. Czajkowski, K., Foster, I.T., Kasselman, C.: Resource Co-allocation in Computational Grids. In: Proceedings of the 8th IEEE International Symposium on High Performance and Distributed Computing, California, USA, pp. 37–47 (1999)

    Google Scholar 

  8. Edmonds, J.: Scheduling in the Dark. In: Proceedings of the 31st Annual ACM Symposium on Theory of Computing, pp. 179–188 (1999)

    Google Scholar 

  9. Feitelson, D.G., Rudolph, L., Schweigelshohn, U., Sevcik, K., Wong, P.: Theory and Practice in Parallel Job Scheduling. In: Feitelson, D.G., Rudolph, L. (eds.) IPPS-WS 1997 and JSSPP 1997. LNCS, vol. 1291, pp. 1–34. Springer, Heidelberg (1997)

    Google Scholar 

  10. Feitelson, D.G., Rudolph, L., Schwiegelshohn, U.: Parallel Job Scheduling - A Status Report. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2004. LNCS, vol. 3277, pp. 1–16. Springer, Heidelberg (2005)

    Google Scholar 

  11. Feitelson, D.G., Rudolph, L.: Towards Convergence of Job Schedulers for Parallel Supercomputers. In: Feitelson, D.G., Rudolph, L. (eds.) IPPS-WS 1996 and JSSPP 1996. LNCS, vol. 1162, pp. 1–26. Springer, Heidelberg (1996)

    Chapter  Google Scholar 

  12. Feitelson, D.G.: Metric and Workload Effects on Computer Systems Evaluation. Computers 18, 18–25 (2003)

    Article  Google Scholar 

  13. Foster, I., Kasselman, C.: The Grid: Blue Print for a New Computing Infrastructure. Morgan Kaufmann, San Francisco, CA, USA (1999)

    Google Scholar 

  14. Frachtenberg, E., Feitelson, D.G.: Pitfalls in Parallel Job Scheduling Evaluation. In: Feitelson, D.G., Frachtenberg, E., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2005. LNCS, vol. 3834, pp. 257–282. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  15. Jones, W.M.: Improving Parallel Job Scheduling Performance in Multi-clusters Through Selective Job Co-allocation, PhD dissertation, Clemson University, Clemson, South Carolina, USA (2005)

    Google Scholar 

  16. Lee, C.B., Schwartzman, Y., Hardy, J., Snavely, A.: Are User Runtime Estimates Inherently Inaccurate? In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2004. LNCS, vol. 3277, pp. 253–263. Springer, Heidelberg (2005)

    Google Scholar 

  17. Li, H., Groep, D., Walters, L.: Workload Characteristics of a Multi-cluster Supercomputer. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2004. LNCS, vol. 3277, pp. 176–193. Springer, Heidelberg (2005)

    Google Scholar 

  18. Lifka, L.: The ANL/IBM SP Scheduling System. In: Feitelson, D.G., Rudolph, L. (eds.) IPPS-WS 1995 and JSSPP 1995. LNCS, vol. 949, pp. 295–303. Springer, Heidelberg (1995)

    Google Scholar 

  19. Moreira, J., Pattnaik, P., Franke, H., Jann, J.: An Evaluation of Parallel Job Scheduling for ASCI Blue-Pacific. In: Proceedings of the IEEE/ACM Supercomputing Conference SC 1999. Portland, Oregon, USA (1999)

    Google Scholar 

  20. Mualem, A.W., Feitelson, D.G.: Utilization, Predictability, Workloads and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling. IEEE Transactions in Parallel and Distributed Systems 12(6), 529–543 (2001)

    Article  Google Scholar 

  21. Shmueli, E., Feitelson, D.G.: Backfilling with Lookahead to Optimize the Performance of Parallel Job Scheduling. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2003. LNCS, vol. 2862, pp. 228–251. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  22. Srinivasan, S., Krishnamoorthy, S., Sadayappan, P.: A Robust Scheduling Technology for Moldable Scheduling of Parallel Jobs. In: Proceedings of the IEEE International Conference on Cluster Computing, pp. 92–99 (2003)

    Google Scholar 

  23. Strohmaier, E., Dongarra, J.J., Meuer, H.W., Simon, D.: Recent Trends in the Marketplace of High Performance Computing. Journal of Parallel Computing 31, 261–273 (2005)

    Article  Google Scholar 

  24. Srinivasan, S., Kettimuthu, R., Subramani, V., Sadayappan, P.: Selective Reservation Strategies for Backfill Job Scheduling. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2002. LNCS, vol. 2537, pp. 55–71. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  25. Srinivasan, S., Kettimuthu, R., Subramani, V., Sadayappan, P.: Characterization of backfilling Strategies for Parallel Job Scheduling. In: Proceedings of the 2002 International Conference on Parallel Processing Workshops, pp. 514–520 (2002)

    Google Scholar 

  26. The Distributed ASCI Supercomputer, http://www.cs.vu.nl/das2

  27. The Global Grid Forum, http://www.gridforum.com

  28. The Mesquite Software inc: The CSIM 18 Simulation Engine Users Guide

    Google Scholar 

  29. The Parallel Workloads Archive, http://www.cs.huji.ac.il/labs/parallel/workload/logs.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Eitan Frachtenberg Uwe Schwiegelshohn

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ngubiri, J., van Vliet, M. (2008). Group-Wise Performance Evaluation of Processor Co-allocation in Multi-cluster Systems. In: Frachtenberg, E., Schwiegelshohn, U. (eds) Job Scheduling Strategies for Parallel Processing. JSSPP 2007. Lecture Notes in Computer Science, vol 4942. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78699-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78699-3_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78698-6

  • Online ISBN: 978-3-540-78699-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics