Group-Wise Performance Evaluation of Processor Co-allocation in Multi-cluster Systems

Ngubiri, John; van Vliet, Mario

doi:10.1007/978-3-540-78699-3_2

John Ngubiri¹ &
Mario van Vliet¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4942))

Included in the following conference series:

Workshop on Job Scheduling Strategies for Parallel Processing

429 Accesses
5 Citations

Abstract

Performance evaluation in multi-cluster processor co-allocation - like in many other parallel job scheduling problems- is mostly done by computing the average metric value for the entire job stream. This does not give a comprehensive understanding of the relative performance of the different jobs grouped by their characteristics. It is however the characteristics that affect how easy/hard jobs are to schedule. We, therefore, do not get to understand scheduler performance at job type level. In this paper, we study the performance of multi-cluster processor co-allocation for different job groups grouped by their size, components and widest component. We study their relative performance, sensitivity to parameters and how their performance is affected by the heuristics used to break them up into components. We show that the widest component us characteristic that most affects job schedulability. We also show that to get better performance, jobs should be broken up in such a way that the width of the widest component is minimized.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bal, et al.: The Distributed ASCI Supercomputer Project. Operating Systems Review 34(4), 76–96 (2000)
Article Google Scholar
Bucur, A.I.D., Epema, D.H.J.: The Influence of the Structure and Sizes of Jobs on the Performance of Co-allocation. In: Feitelson, D.G., Rudolph, L. (eds.) IPDPS-WS 2000 and JSSPP 2000. LNCS, vol. 1911, pp. 154–173. Springer, Heidelberg (2000)
Chapter Google Scholar
Bucur, A.I.D., Epema, D.H.J.: The Performance of Processor Co-allocation in Multicluster Systems. In: proceedings of the 3^rd IEEE/ ACM International Symposium on Cluster Computing and the Grid (CCGrid 2003), pp. 302–309 (2003)
Google Scholar
Bucur, A.I.D.: Performance Analysis of Processor Co-allocation in Multicluster Systems. PhD Thesis, Delft University of Technology, Delft, The Netherlands (2004)
Google Scholar
Bucur, A.I.D., Epema, D.H.J.: Local versus Global Schedulers with Processor Co-allocation in Multicluster Systems. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2002. LNCS, vol. 2537, pp. 184–204. Springer, Heidelberg (2002)
Chapter Google Scholar
Chiang, S.H., Arpaci-Dusseau, A., Vernon, M.K.: The Impact of More Accurate Requested Runtimes on Production Job Scheduling Performance. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2002. LNCS, vol. 2537, pp. 103–127. Springer, Heidelberg (2002)
Chapter Google Scholar
Czajkowski, K., Foster, I.T., Kasselman, C.: Resource Co-allocation in Computational Grids. In: Proceedings of the 8^th IEEE International Symposium on High Performance and Distributed Computing, California, USA, pp. 37–47 (1999)
Google Scholar
Edmonds, J.: Scheduling in the Dark. In: Proceedings of the 31^st Annual ACM Symposium on Theory of Computing, pp. 179–188 (1999)
Google Scholar
Feitelson, D.G., Rudolph, L., Schweigelshohn, U., Sevcik, K., Wong, P.: Theory and Practice in Parallel Job Scheduling. In: Feitelson, D.G., Rudolph, L. (eds.) IPPS-WS 1997 and JSSPP 1997. LNCS, vol. 1291, pp. 1–34. Springer, Heidelberg (1997)
Google Scholar
Feitelson, D.G., Rudolph, L., Schwiegelshohn, U.: Parallel Job Scheduling - A Status Report. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2004. LNCS, vol. 3277, pp. 1–16. Springer, Heidelberg (2005)
Google Scholar
Feitelson, D.G., Rudolph, L.: Towards Convergence of Job Schedulers for Parallel Supercomputers. In: Feitelson, D.G., Rudolph, L. (eds.) IPPS-WS 1996 and JSSPP 1996. LNCS, vol. 1162, pp. 1–26. Springer, Heidelberg (1996)
Chapter Google Scholar
Feitelson, D.G.: Metric and Workload Effects on Computer Systems Evaluation. Computers 18, 18–25 (2003)
Article Google Scholar
Foster, I., Kasselman, C.: The Grid: Blue Print for a New Computing Infrastructure. Morgan Kaufmann, San Francisco, CA, USA (1999)
Google Scholar
Frachtenberg, E., Feitelson, D.G.: Pitfalls in Parallel Job Scheduling Evaluation. In: Feitelson, D.G., Frachtenberg, E., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2005. LNCS, vol. 3834, pp. 257–282. Springer, Heidelberg (2005)
Chapter Google Scholar
Jones, W.M.: Improving Parallel Job Scheduling Performance in Multi-clusters Through Selective Job Co-allocation, PhD dissertation, Clemson University, Clemson, South Carolina, USA (2005)
Google Scholar
Lee, C.B., Schwartzman, Y., Hardy, J., Snavely, A.: Are User Runtime Estimates Inherently Inaccurate? In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2004. LNCS, vol. 3277, pp. 253–263. Springer, Heidelberg (2005)
Google Scholar
Li, H., Groep, D., Walters, L.: Workload Characteristics of a Multi-cluster Supercomputer. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2004. LNCS, vol. 3277, pp. 176–193. Springer, Heidelberg (2005)
Google Scholar
Lifka, L.: The ANL/IBM SP Scheduling System. In: Feitelson, D.G., Rudolph, L. (eds.) IPPS-WS 1995 and JSSPP 1995. LNCS, vol. 949, pp. 295–303. Springer, Heidelberg (1995)
Google Scholar
Moreira, J., Pattnaik, P., Franke, H., Jann, J.: An Evaluation of Parallel Job Scheduling for ASCI Blue-Pacific. In: Proceedings of the IEEE/ACM Supercomputing Conference SC 1999. Portland, Oregon, USA (1999)
Google Scholar
Mualem, A.W., Feitelson, D.G.: Utilization, Predictability, Workloads and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling. IEEE Transactions in Parallel and Distributed Systems 12(6), 529–543 (2001)
Article Google Scholar
Shmueli, E., Feitelson, D.G.: Backfilling with Lookahead to Optimize the Performance of Parallel Job Scheduling. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2003. LNCS, vol. 2862, pp. 228–251. Springer, Heidelberg (2003)
Chapter Google Scholar
Srinivasan, S., Krishnamoorthy, S., Sadayappan, P.: A Robust Scheduling Technology for Moldable Scheduling of Parallel Jobs. In: Proceedings of the IEEE International Conference on Cluster Computing, pp. 92–99 (2003)
Google Scholar
Strohmaier, E., Dongarra, J.J., Meuer, H.W., Simon, D.: Recent Trends in the Marketplace of High Performance Computing. Journal of Parallel Computing 31, 261–273 (2005)
Article Google Scholar
Srinivasan, S., Kettimuthu, R., Subramani, V., Sadayappan, P.: Selective Reservation Strategies for Backfill Job Scheduling. In: Feitelson, D.G., Rudolph, L., Schwiegelshohn, U. (eds.) JSSPP 2002. LNCS, vol. 2537, pp. 55–71. Springer, Heidelberg (2002)
Chapter Google Scholar
Srinivasan, S., Kettimuthu, R., Subramani, V., Sadayappan, P.: Characterization of backfilling Strategies for Parallel Job Scheduling. In: Proceedings of the 2002 International Conference on Parallel Processing Workshops, pp. 514–520 (2002)
Google Scholar
The Distributed ASCI Supercomputer, http://www.cs.vu.nl/das2
The Global Grid Forum, http://www.gridforum.com
The Mesquite Software inc: The CSIM 18 Simulation Engine Users Guide
Google Scholar
The Parallel Workloads Archive, http://www.cs.huji.ac.il/labs/parallel/workload/logs.html

Download references

Author information

Authors and Affiliations

Nijmegen Institute for Informatics and Information Science, Radboud University Nijmegen, Toernooiveld 1, 6525 ED, Nijmegen, The Netherlands
John Ngubiri & Mario van Vliet

Authors

John Ngubiri
View author publications
You can also search for this author in PubMed Google Scholar
Mario van Vliet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Eitan Frachtenberg Uwe Schwiegelshohn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ngubiri, J., van Vliet, M. (2008). Group-Wise Performance Evaluation of Processor Co-allocation in Multi-cluster Systems. In: Frachtenberg, E., Schwiegelshohn, U. (eds) Job Scheduling Strategies for Parallel Processing. JSSPP 2007. Lecture Notes in Computer Science, vol 4942. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78699-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-78699-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78698-6
Online ISBN: 978-3-540-78699-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics