From Application to Disk: Tracing I/O Through the Big Data Stack

Schmidtke, Robert; Schintke, Florian; Schütt, Thorsten

doi:10.1007/978-3-030-02465-9_6

From Application to Disk: Tracing I/O Through the Big Data Stack

Robert Schmidtke ORCID: orcid.org/0000-0002-5788-1596¹⁶,
Florian Schintke¹⁶ &
Thorsten Schütt¹⁶

Conference paper
First Online: 25 January 2019

1292 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11203))

Abstract

Typical applications in data science consume, process and produce large amounts of data, making disk I/O one of the dominating—and thus worthwhile optimizing—factors of their overall performance. Distributed processing frameworks, such as Hadoop, Flink and Spark, hide a lot of complexity from the programmer when they parallelize these applications across a compute cluster. This exacerbates reasoning about I/O of both the application and the framework, through the distributed file system, such as HDFS, down to the local file systems.

We present SFS (Statistics File System), a modular framework to trace each I/O request issued by the application and any JVM-based big data framework involved, mapping these requests to actual disk I/O.

This allows detection of inefficient I/O patterns, both by the applications and the underlying frameworks, and builds the basis for improving I/O scheduling in the big data software stack.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
GitHub: https://github.com/robert-schmidtke/hdfs-statistics-adapter.
2.
We assume NTP synchronized clocks for JVMs/CPUs and nodes.
3.
We successfully tested SFS on Oracle Java 1.8.0_45-b14 as well.
4.
https://issues.apache.org/jira/browse/MAPREDUCE-6923.

References

Anderson, J.M., et al.: Continuous profiling: where have all the cycles gone? ACM Trans. Comput. Syst. 15(4), 1–14 (1997)
Article Google Scholar
Apache Software Foundation: Mapreduce default configuration (2017). https://github.com/apache/hadoop/blob/branch-2.7.4/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/mapred-default.xml
Apache Software Foundation: org.apache.hadoop.examples.terasort package description (2017). https://hadoop.apache.org/docs/r2.7.4/api/org/apache/hadoop/examples/terasort/package-summary.html#package.description
Chang, F., et al.: Bigtable: a distributed storage system for structured data. ACM Trans. Comput. Syst. 26(2), 4:1–4:26 (2008). https://doi.org/10.1145/1365815.1365816
Article Google Scholar
Chow, M., Meisner, D., Flinn, J., Peek, D., Wenisch, T.F.: The mystery machine: end-to-end performance analysis of large-scale Internet services. In: 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2014), Broomfield, CO, pp. 217–231. USENIX Association (2014). https://www.usenix.org/conference/osdi14/technical-sessions/presentation/chow
Conrad, T.O.F., et al.: Sparse proteomics analysis - a compressed sensing-based approach for feature selection and classification of high-dimensional proteomics mass spectrometry data. BMC Bioinform. 18(1), 160 (2017). https://doi.org/10.1186/s12859-017-1565-4
Article Google Scholar
Côté, R.G., Reisinger, F., Martens, L.: jmzML, an open-source Java API for mzML, the PSI standard for MS data. Proteomics 10(7), 1332–1335 (2010). https://doi.org/10.1002/pmic.200900719
Article Google Scholar
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008). https://doi.org/10.1145/1327452.1327492
Article Google Scholar
Ghemawat, S., Gobioff, H., Leung, S.T.: The Google file system. SIGOPS Oper. Syst. Rev. 37(5), 29–43 (2003). https://doi.org/10.1145/1165389.945450
Article Google Scholar
Graham, S.L., Kessler, P.B., Mckusick, M.K.: Gprof: a call graph execution profiler. In: ACM SIGPLAN Notices, vol. 17, pp. 120–126. ACM (1982)
Google Scholar
Gregg, B.: Linux performance, August 2017. http://www.brendangregg.com/linuxperf.html
Harter, T., et al.: Analysis of HDFS under HBase: a Facebook messages case study. In: Proceedings of the 12th USENIX Conference on File and Storage Technologies, FAST 2014, pp. 199–212. USENIX Association, Berkeley (2014). http://dl.acm.org/citation.cfm?id=2591305.2591325
Joukov, N., Traeger, A., Iyer, R., Wright, C.P., Zadok, E.: Operating system profiling via latency analysis. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, pp. 89–102. USENIX Association (2006)
Google Scholar
Kim, S., Kim, H., Jonwoon, L., Jeong, J.: Enlightening the I/O path: a holistic approach for application performance. In: Proceedings of the 15th USENIX Conference on File and Storage Technologies, FAST 2017, pp. 345–358. USENIX Association, Berkeley (2017). https://www.usenix.org/system/files/conference/fast17/fast17-kim-sangwook.pdf
Liu, Y., Gunasekaran, R., Ma, X., Vazhkudai, S.S.: Automatic identification of application I/O signatures from noisy server-side traces. In: Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST 2014), Santa Clara, CA, pp. 213–228. USENIX (2014). https://www.usenix.org/conference/fast14/technical-sessions/presentation/liu
Mace, J., Roelke, R., Fonseca, R.: Pivot tracing: dynamic causal monitoring for distributed systems. In: Proceedings of the 25th Symposium on Operating Systems Principles, SOSP 2015, pp. 378–393. ACM, New York (2015). https://doi.org/10.1145/2815400.2815415
O’Malley, O.: Terabyte sort on Apache Hadoop, May 2008. http://sortbenchmark.org/YahooHadoop.pdf
O’Malley, O., Murthy, A.C.: Winning a 60 second dash with a yellow elephant, April 2009. https://pdfs.semanticscholar.org/176b/c836e106bfdfe818adfc9dc1c0b150d85e54.pdf
Ousterhout, J.K., Da Costa, H., Harrison, D., Kunze, J.A., Kupfer, M., Thompson, J.G.: A trace-driven analysis of the UNIX 4.2 BSD file system, vol. 19. ACM (1985)
Google Scholar
Stefanovici, I., Schroeder, B., O’Shea, G., Thereska, E.: Treating the storage stack like a network. Trans. Storage 13(1), 2:1–2:27 (2017). https://doi.org/10.1145/3032968
Article Google Scholar
Thereska, E., et al.: IOFlow: a software-defined storage architecture. In: Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP 2013, pp. 182–196. ACM, New York (2013). https://doi.org/10.1145/2517349.2522723
Thompson, K., Ritchie, D.M.: Unix Programmer’s Manual, 5 edn, June 1974
Google Scholar
Transaction Processing Performance Council: TPC Express Benchmark™HS, June 2017. http://www.tpc.org/tpc_documents_current_versions/pdf/tpcx-hs_v2.0.1.pdf
White, T.: Hadoop: The Definitive Guide, 4th edn. O’Reilly Media Inc., Sebastopol (2015)
Google Scholar
Zadok, E., Nieh, J.: Fist: a language for stackable file systems. In: Proceedings of the Annual Conference on USENIX Annual Technical Conference, ATEC 2000, p. 5. USENIX Association, Berkeley (2000). http://dl.acm.org/citation.cfm?id=1267724.1267729
Article Google Scholar
Zhao, X., Rodrigues, K., Luo, Y., Yuan, D., Stumm, M.: Non-intrusive performance profiling for entire software stacks based on the flow reconstruction principle. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2016), pp. 603–618. USENIX Association (2016). https://www.usenix.org/conference/osdi16/technical-sessions/presentation/zhao

Download references

Acknowledgments

This work received funding from the BMBF projects Berlin Big Data Center (BBDC) under grant 01IS14013B and GeoMultiSens under grant 01IS14010C.

Author information

Authors and Affiliations

Zuse Institute Berlin, 14195, Berlin, Germany
Robert Schmidtke, Florian Schintke & Thorsten Schütt

Authors

Robert Schmidtke
View author publications
You can also search for this author in PubMed Google Scholar
Florian Schintke
View author publications
You can also search for this author in PubMed Google Scholar
Thorsten Schütt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert Schmidtke .

Editor information

Editors and Affiliations

Tokyo Institute of Technology, Tokyo, Japan
Rio Yokota
University of Edinburgh, Edinburgh, UK
Michèle Weiland
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
John Shalf
Swiss National Supercomputing Centre, Lugano, Switzerland
Sadaf Alam

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schmidtke, R., Schintke, F., Schütt, T. (2018). From Application to Disk: Tracing I/O Through the Big Data Stack. In: Yokota, R., Weiland, M., Shalf, J., Alam, S. (eds) High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science(), vol 11203. Springer, Cham. https://doi.org/10.1007/978-3-030-02465-9_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-02465-9_6
Published: 25 January 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02464-2
Online ISBN: 978-3-030-02465-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics