Decision Trees and MPI Collective Algorithm Selection Problem

Pješivac-Grbović, Jelena; Bosilca, George; Fagg, Graham E.; Angskun, Thara; Dongarra, Jack J.

doi:10.1007/978-3-540-74466-5_13

Jelena Pješivac-Grbović¹,
George Bosilca¹,
Graham E. Fagg¹,
Thara Angskun¹ &
…
Jack J. Dongarra¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4641))

Included in the following conference series:

European Conference on Parallel Processing

870 Accesses
10 Citations

Abstract

Selecting the close-to-optimal collective algorithm based on the parameters of the collective call at run time is an important step for achieving good performance of MPI applications. In this paper, we explore the applicability of C4.5 decision trees to the MPI collective algorithm selection problem. We construct C4.5 decision trees from the measured algorithm performance data and analyze both the decision tree properties and the expected run time performance penalty.

In cases we considered, results show that the C4.5 decision trees can be used to generate a reasonably small and very accurate decision function. For example, the broadcast decision tree with only 21 leaves was able to achieve a mean performance penalty of 2.08%. Similarly, combining experimental data for reduce and broadcast and generating a decision function from the combined decision trees resulted in less than 2.5% relative performance penalty. The results indicate that C4.5 decision trees are applicable to this problem and should be more widely used in this domain.

Download to read the full chapter text

Chapter PDF

Ensemble Move Acceptance in Selection Hyper-heuristics

Decomposition and merging cooperative particle swarm optimization with random grouping for large-scale optimization problems

Article 14 November 2023

Analyzing distributed Spark MLlib regression algorithms for accuracy, execution efficiency and scalability using best subset selection approach

Article 17 October 2023

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Rabenseifner, R.: Automatic MPI counter profiling of all users: First results on a CRAY T3E 900-512. In: Proceedings of the Message Passing Interface Developer’s and User’s Conference, pp. 77–85 (1999)
Google Scholar
Worringen, J.: Pipelining and overlapping for MPI collective operations. In: 28th Annyal IEEE Conference on Local Computer Network, Bonn/Königswinter, Germany, pp. 548–557. IEEE Computer Society Press, Los Alamitos (2003)
Google Scholar
Rabenseifner, R., Träff, J.L.: More efficient reduction algorithms for non-power-of-two number of processors in message-passing parallel systems. In: Kranzlmüller, D., Kacsuk, P., Dongarra, J.J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface. LNCS, vol. 3241, Springer, Heidelberg (2004)
Google Scholar
Chan, E.W., Heimlich, M.F., Purkayastha, A., van de Geijn, R.M.: On optimizing of collective communication. In: Proceedings of IEEE International Conference on Cluster Computing, 145–155 (2004)
Google Scholar
Bernaschi, M., Iannello, G., Lauria, M.: Efficient implementation of reduce-scatter in MPI. Journal of Systems Architure 49(3), 89–108 (2003)
Article Google Scholar
Thakur, R., Rabenseifner, R., Gropp, W.: Optimization of Collective Communication Operations in MPICH. International Journal of High Performance Computing Applications 19(1), 49–66 (2005)
Article Google Scholar
Kielmann, T., Hofman, R.F.H., Bal, H.E., Plaat, A., Bhoedjang, R.A.F.: MagPIe: MPI’s collective communication operations for clustered wide area systems. In: Proceedings of the ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, pp. 131–140. ACM Press, New York (1999)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo, California (1993)
Google Scholar
Pješivac-Grbović, J., Angskun, T., Bosilca, G., Fagg, G.E., Gabriel, E., Dongarra, J.J.: Performance analysis of MPI collective operations. In: Proceedings of IPDPS 2005 - PMEO-PDS Workshop, p. 272. IEEE Computer Society Press, Los Alamitos (2005)
Google Scholar
Fagg, G.E., Gabriel, E., Bosilca, G., Angskun, T., Chen, Z., Pješivac-Grbović, J., London, K., Dongarra, J.: Extending the MPI specification for process fault tolerance on high performance computing systems. In: Proceedings of the International Supercomputer Conference (ISC) 2004, Primeur (2004)
Google Scholar
Fagg, G.E., Bosilca, G., Pješivac-Grbović, J., Angskun, T., Dongarra, J.: Tuned: A flexible high performance collective communication component developed for Open MPI. In: Proccedings of DAPSYS 2006, Innsbruck, Austria, pp. 65–72. Springer, Heidelberg (2006)
Google Scholar
Pješivac-Grbović, J., Fagg, G.E., Angskun, T., Bosilca, G., Dongarra, J.J.: MPI collective algorithm selection and quadtree encoding. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface. LNCS, vol. 4192, pp. 40–48. Springer, Heidelberg (2006)
Chapter Google Scholar
Vuduc, R., Demmel, J.W., Bilmes, J.A.: Statistical Models for Empirical Search-Based Performance Tuning. International Journal of High Performance Computing Applications 18(1), 65–94 (2004)
Article Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York, NY (1998)
MATH Google Scholar
Quinlan, J.R.: C4.5 source code (2006), http://www.rulequest.com/Personal
MPICH-2: Implementation of MPI 2 standard (2005), http://www-unix.mcs.anl.gov/mpi/mpich2
OCC: Optimized Collective Communication Library (2005), http://www.cs.utk.edu/~pjesa/projects/occ
SKaMPI: Special Karlsruher MPI Benchmark (2005), http://liinwww.ira.uka.de/~skampi

Download references

Author information

Authors and Affiliations

Innovative Computing Laboratory, The University of Tennessee Computer Science Department, 1122 Volunteer Blvd., Knoxville, TN 37996-3450, USA
Jelena Pješivac-Grbović, George Bosilca, Graham E. Fagg, Thara Angskun & Jack J. Dongarra

Authors

Jelena Pješivac-Grbović
View author publications
You can also search for this author in PubMed Google Scholar
George Bosilca
View author publications
You can also search for this author in PubMed Google Scholar
Graham E. Fagg
View author publications
You can also search for this author in PubMed Google Scholar
Thara Angskun
View author publications
You can also search for this author in PubMed Google Scholar
Jack J. Dongarra
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Anne-Marie Kermarrec Luc Bougé Thierry Priol

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pješivac-Grbović, J., Bosilca, G., Fagg, G.E., Angskun, T., Dongarra, J.J. (2007). Decision Trees and MPI Collective Algorithm Selection Problem. In: Kermarrec, AM., Bougé, L., Priol, T. (eds) Euro-Par 2007 Parallel Processing. Euro-Par 2007. Lecture Notes in Computer Science, vol 4641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74466-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-74466-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74465-8
Online ISBN: 978-3-540-74466-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Decision Trees and MPI Collective Algorithm Selection Problem

Abstract

Chapter PDF

Similar content being viewed by others

Ensemble Move Acceptance in Selection Hyper-heuristics

Decomposition and merging cooperative particle swarm optimization with random grouping for large-scale optimization problems

Analyzing distributed Spark MLlib regression algorithms for accuracy, execution efficiency and scalability using best subset selection approach

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Decision Trees and MPI Collective Algorithm Selection Problem

Abstract

Chapter PDF

Similar content being viewed by others

Ensemble Move Acceptance in Selection Hyper-heuristics

Decomposition and merging cooperative particle swarm optimization with random grouping for large-scale optimization problems

Analyzing distributed Spark MLlib regression algorithms for accuracy, execution efficiency and scalability using best subset selection approach

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation