Abstract
Characterization of a parallel application’s communication patterns can be useful for performance analysis, debugging, and system design. However, obtaining and interpreting a characterization can be difficult. AChax implements an approach that uses search and a library of known communication patterns to automatically characterize communication patterns. Our approach has some limitations that reduce its effectiveness for the patterns and pattern combinations used by some real-world applications. By viewing AChax’s pattern recognition problem as an image recognition problem, it may be possible to use deep learning to address these limitations. In this position paper, we present our current ideas regarding the benefits and challenges of integrating deep learning into AChax and our conclusion that a hybrid approach combining deep learning classification, regression, and the existing AChax approach may be the best long-term solution to the problem of parameterizing recognized communication patterns.
This manuscript has been co-authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The Garbage pattern is an exception: it only provides a generator method because this pattern’s only purpose is to introduce “noise” into synthetic workloads used in unit testing.
References
Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems (2015). http://download.tensorflow.org/paper/whitepaper2015.pdf
Al-Rfou, R., et al.: Theano: a Python framework for fast computation of mathematical expressions. arXiv e-prints abs/1605.02688, May 2016. http://arxiv.org/abs/1605.02688
Graph-tool: efficient network analysis (2018). https://graph-tool.skewed.de
Gropp, W., Lusk, E., Skjellum, A.: Using MPI: Portable Parallel Programming with the Message-passing Interface. Scientific and Engineering Computation, 2nd edn. MIT Press, Cambridge (1999)
NumPy (2018). http://www.numpy.org
Paszke, A., et al.: Automatic differentiation in PyTorch. In: NIPS 2017 Autodiff Workshop, December 2017
Roth, P.C.: Improved accuracy for automated communication pattern characterization using communication graphs and aggressive search space pruning. In: Bhatele, A., et al. (eds.) ESPT/VPA 2017/2018. LNCS, vol. 11027, pp. 38–55. Springer, Cham (2019)
Roth, P.C.: Scalable, automated characterization of parallel application communication behavior. In: 2018 Scalable Tools Workshop, July 2018
Roth, P.C., Meredith, J.S., Vetter, J.S.: Automated characterization of parallel application communication patterns. In: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2015), Portland, Oregon, USA, pp. 73–84, August 2015. https://doi.org/10.1145/2749246.2749278
Acknowledgments
We thank David Poliakoff of Lawrence Livermore National Laboratory for his helpful feedback about this paper and the tools workshop presentation that motivated it.
This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research under contract number DE-AC05-00OR22725.
This work is supported in part by the US Department of Energy Office of Science SciDAC RAPIDS project under subcontract 4000159855 to the University of Oregon from Oak Ridge National Laboratory.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Roth, P.C., Huck, K., Gopalakrishnan, G., Wolf, F. (2019). Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges. In: Bhatele, A., Boehme, D., Levine, J., Malony, A., Schulz, M. (eds) Programming and Performance Visualization Tools. ESPT ESPT VPA VPA 2017 2018 2017 2018. Lecture Notes in Computer Science(), vol 11027. Springer, Cham. https://doi.org/10.1007/978-3-030-17872-7_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-17872-7_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17871-0
Online ISBN: 978-3-030-17872-7
eBook Packages: Computer ScienceComputer Science (R0)