Understanding Metadata Latency with MDWorkbench

Kunkel, Julian Martin; Markomanolis, George S.

doi:10.1007/978-3-030-02465-9_5

Understanding Metadata Latency with MDWorkbench

Julian Martin Kunkel¹⁶ &
George S. Markomanolis¹⁷

Conference paper
First Online: 25 January 2019

1366 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11203))

Abstract

While parallel file systems often satisfy the need of applications with bulk synchronous I/O, they lack capabilities of dealing with metadata intense workloads. Typically, in procurements, the focus lies on the aggregated metadata throughput using the MDTest benchmark (https://www.vi4io.org/tools/benchmarks/mdtest). However, metadata performance is crucial for interactive use. Metadata benchmarks involve even more parameters compared to I/O benchmarks. There are several aspects that are currently uncovered and, therefore, not in the focus of vendors to investigate. Particularly, response latency and interactive workloads operating on a working set of data. The lack of capabilities from file systems can be observed when looking at the IO-500 list, where metadata performance between best and worst system does not differ significantly.

In this paper, we introduce a new benchmark called MDWorkbench which generates a reproducible workload emulating many concurrent users or – in an alternative view – queuing systems. This benchmark provides a detailed latency profile, overcomes caching issues, and provides a method to assess the quality of the observed throughput. We evaluate the benchmark on state-of-the-art parallel file systems with GPFS (IBM Spectrum Scale), Lustre, Cray’s Datawarp, and DDN IME, and conclude that we can reveal characteristics that could not be identified before.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
http://singularity.lbl.gov/.
2.
https://sourceforge.net/projects/fsmark/.
3.
http://www.storageperformance.org/.
4.
https://www.azul.com/files/HowNotToMeasureLatency_LLSummit_NYC_12Nov2013.pdf.
5.
MDWorkbench is available under: https://github.com/JulianKunkel/md-workbench.
6.
A backend like MPI-IO may implement this operation as NoOp if it is not supported.
7.
This value is used in the IO-500 benchmark as it prevents inode stuffing; for comparison, we choose it.
8.
The plot is sparse, e.g., 100k data points of 1 million creates have been randomly selected. Additionally, all measurements about 0.1 s have been added.

References

Alam, S.R., El-Harake, H.N., Howard, K., Stringfellow, N., Verzelloni, F.: Parallel I/O and the metadata wall. In: Proceedings of the Sixth Workshop on Parallel Data Storage, pp. 13–18. ACM (2011)
Google Scholar
Carns, P., Lang, S., Ross, R., Vilayannur, M., Kunkel, J., Ludwig, T.: Small-file access in parallel file systems. In: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, pp. 1–11 (2009)
Google Scholar
Friedrich, S., et al.: NoSQL OLTP benchmarking: a survey. In: GI-Jahrestagung, pp. 693–704 (2014)
Google Scholar
Hadri, B., Kortas, S., Feki, S., Khurram, R., Newby, G.: Overview of the KAUST’s cray X40 system-Shaheen II. In: Proceeding of Cray User Group (2015)
Google Scholar
Huppler, K.: The art of building a good benchmark. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 18–30. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-10424-4_3
Chapter Google Scholar
Katcher, J.: PostMark: a new file system benchmark. Technical report, TR3022, NetApp (1997)
Google Scholar
Méndez, S., Rexachs, D., Luque, E.: Methodology for performance evaluation of the input/output system on computer clusters. In: 2011 IEEE International Conference on Cluster Computing (CLUSTER), pp. 474–483 (2011)
Google Scholar
Storage Performance Council: SPC BENCHMARK 1 (SPC-1) - Rev. 3.5, September 2017
Google Scholar

Download references

Acknowledgements

Thanks for DDN providing access to their facility and the discussion with Jean-Thomas Acquaviva and Jay Lofstead. This research used resources of the KAUST Supercomputing Core Laboratory, of the Argonne Leadership Computing Facility and NERSC, which are under DOE Office of Science User Facilities supported under Contract DE-AC02-06CH11357 and DE-AC02-05CH11231 respectively.

Author information

Authors and Affiliations

University of Reading, Reading, UK
Julian Martin Kunkel
KAUST Supercomputing Laboratory, Thuwal, Saudi Arabia
George S. Markomanolis

Authors

Julian Martin Kunkel
View author publications
You can also search for this author in PubMed Google Scholar
George S. Markomanolis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julian Martin Kunkel .

Editor information

Editors and Affiliations

Tokyo Institute of Technology, Tokyo, Japan
Rio Yokota
University of Edinburgh, Edinburgh, UK
Michèle Weiland
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
John Shalf
Swiss National Supercomputing Centre, Lugano, Switzerland
Sadaf Alam

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kunkel, J.M., Markomanolis, G.S. (2018). Understanding Metadata Latency with MDWorkbench. In: Yokota, R., Weiland, M., Shalf, J., Alam, S. (eds) High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science(), vol 11203. Springer, Cham. https://doi.org/10.1007/978-3-030-02465-9_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-02465-9_5
Published: 25 January 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02464-2
Online ISBN: 978-3-030-02465-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics