Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges

Roth, Philip C.; Huck, Kevin; Gopalakrishnan, Ganesh; Wolf, Felix

doi:10.1007/978-3-030-17872-7_16

Philip C. Roth ORCID: orcid.org/0000-0001-9583-1103¹⁹,
Kevin Huck²⁰,
Ganesh Gopalakrishnan²¹ &
…
Felix Wolf²²

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 11027))

Included in the following conference series:

516 Accesses
1 Citations

Abstract

Characterization of a parallel application’s communication patterns can be useful for performance analysis, debugging, and system design. However, obtaining and interpreting a characterization can be difficult. AChax implements an approach that uses search and a library of known communication patterns to automatically characterize communication patterns. Our approach has some limitations that reduce its effectiveness for the patterns and pattern combinations used by some real-world applications. By viewing AChax’s pattern recognition problem as an image recognition problem, it may be possible to use deep learning to address these limitations. In this position paper, we present our current ideas regarding the benefits and challenges of integrating deep learning into AChax and our conclusion that a hybrid approach combining deep learning classification, regression, and the existing AChax approach may be the best long-term solution to the problem of parameterizing recognized communication patterns.

This manuscript has been co-authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The Garbage pattern is an exception: it only provides a generator method because this pattern’s only purpose is to introduce “noise” into synthetic workloads used in unit testing.

References

Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems (2015). http://download.tensorflow.org/paper/whitepaper2015.pdf
Al-Rfou, R., et al.: Theano: a Python framework for fast computation of mathematical expressions. arXiv e-prints abs/1605.02688, May 2016. http://arxiv.org/abs/1605.02688
Graph-tool: efficient network analysis (2018). https://graph-tool.skewed.de
Gropp, W., Lusk, E., Skjellum, A.: Using MPI: Portable Parallel Programming with the Message-passing Interface. Scientific and Engineering Computation, 2nd edn. MIT Press, Cambridge (1999)
Book Google Scholar
NumPy (2018). http://www.numpy.org
Paszke, A., et al.: Automatic differentiation in PyTorch. In: NIPS 2017 Autodiff Workshop, December 2017
Google Scholar
Roth, P.C.: Improved accuracy for automated communication pattern characterization using communication graphs and aggressive search space pruning. In: Bhatele, A., et al. (eds.) ESPT/VPA 2017/2018. LNCS, vol. 11027, pp. 38–55. Springer, Cham (2019)
Google Scholar
Roth, P.C.: Scalable, automated characterization of parallel application communication behavior. In: 2018 Scalable Tools Workshop, July 2018
Google Scholar
Roth, P.C., Meredith, J.S., Vetter, J.S.: Automated characterization of parallel application communication patterns. In: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2015), Portland, Oregon, USA, pp. 73–84, August 2015. https://doi.org/10.1145/2749246.2749278

Download references

Acknowledgments

We thank David Poliakoff of Lawrence Livermore National Laboratory for his helpful feedback about this paper and the tools workshop presentation that motivated it.

This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research under contract number DE-AC05-00OR22725.

This work is supported in part by the US Department of Energy Office of Science SciDAC RAPIDS project under subcontract 4000159855 to the University of Oregon from Oak Ridge National Laboratory.

Author information

Authors and Affiliations

Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
Philip C. Roth
University of Oregon, Eugene, OR, 97403, USA
Kevin Huck
University of Utah, Salt Lake City, UT, 84112, USA
Ganesh Gopalakrishnan
Technische Universität Darmstadt, 64289, Darmstadt, Germany
Felix Wolf

Authors

Philip C. Roth
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Huck
View author publications
You can also search for this author in PubMed Google Scholar
Ganesh Gopalakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Felix Wolf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philip C. Roth .

Editor information

Editors and Affiliations

Lawrence Livermore National Laboratory, Livermore, CA, USA
Abhinav Bhatele
Lawrence Livermore National Laboratory, Livermore, CA, USA
David Boehme
The University of Arizona, Tucson, AZ, USA
Joshua A. Levine
University of Oregon, Eugene, OR, USA
Allen D. Malony
Technical University of Munich, Munich, Germany
Martin Schulz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roth, P.C., Huck, K., Gopalakrishnan, G., Wolf, F. (2019). Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges. In: Bhatele, A., Boehme, D., Levine, J., Malony, A., Schulz, M. (eds) Programming and Performance Visualization Tools. ESPT ESPT VPA VPA 2017 2018 2017 2018. Lecture Notes in Computer Science(), vol 11027. Springer, Cham. https://doi.org/10.1007/978-3-030-17872-7_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-17872-7_16
Published: 24 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17871-0
Online ISBN: 978-3-030-17872-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics