Abstract
This work looks at analyzing I/O traffic of users’ jobs on a HPC machine for a period of time. Monitoring tools are collecting the data in a continuous basis on the HPC system. We looked at aggregate I/O data usage patterns of users’ jobs on the system both on the parallel shared Lustre file system and the node-local SSDs. Data mining tools are then applied to analyze the I/O usage pattern data in an attempt to tie the data to particular codes that produced those I/O behaviors from users’ jobs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Comet User Guide. http://www.sdsc.edu/support/user_guides/comet.html
Hammond, J.: TACC stats: I/O performance monitoring for the instransigent. In: Invited Keynote for the 3rd IASDS Workshop, pp. 1–29 (2011)
Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Simoudis, E., Han, J., Fayyad, U.M. (eds.) Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-1996), pp. 226–231. AAAI Press (1996)
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. JMLR 12, 2825–2830 (2011)
Kriegel, H.-P., Kroeger, P., Sander, J., Zimek, A.: Density-based clustering. WIREs Data Min. Knowl. Discov. 1(3), 231–240 (2011)
Acknowledgement
Authors acknowledge funding support/sponsorship from Engility Corporation’s High Performance Computing Center of Excellence (HPC CoE) that was used to support the student research. Authors thank Dr. Rajiv Bendale, Engility Corporation, for many valuable suggestions for this project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Nazrul, S.S. et al. (2019). Analyzing IO Usage Patterns of User Jobs to Improve Overall HPC System Efficiency. In: Majumdar, A., Arora, R. (eds) Software Challenges to Exascale Computing. SCEC 2018. Communications in Computer and Information Science, vol 964. Springer, Singapore. https://doi.org/10.1007/978-981-13-7729-7_7
Download citation
DOI: https://doi.org/10.1007/978-981-13-7729-7_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-7728-0
Online ISBN: 978-981-13-7729-7
eBook Packages: Computer ScienceComputer Science (R0)