Skip to main content

Mediating Data Center Storage Diversity in HPC Applications with FAODEL

  • Conference paper
  • First Online:
Book cover High Performance Computing (ISC High Performance 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11887))

Included in the following conference series:

  • 5863 Accesses

Abstract

Composition of computational science applications into both ad hoc pipelines for analysis of collected or generated data and into well-defined and repeatable workflows is becoming increasingly popular. Meanwhile, dedicated high performance computing storage environments are rapidly becoming more diverse, with both significant amounts of non-volatile memory storage and mature parallel file systems available. At the same time, computational science codes are being coupled to data analysis tools which are not filesystem-oriented. In this paper, we describe how the FAODEL data management service can expose different available data storage options and mediate among them in both application- and FAODEL-directed ways. These capabilities allow applications to exploit their knowledge of the different types of data they may exchange during a workflow execution, and also provide FAODEL with mechanisms to proactively tune data storage behavior when appropriate. We describe the implementation of these capabilities in FAODEL and how they are used by applications, and present preliminary performance results demonstrating the potential benefits of our approach.

SAND2019-6668C.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. The OpenACC application programming interface, November 2018. http://openacc-standard.org

  2. Adjie-Winoto, W., Schwartz, E., Balakrishnan, H., Lilley, J.: The design and implementation of an intentional naming system. In: Proceedings of the Seventeenth ACM Symposium on Operating Systems Principles, SOSP 1999, pp. 186–201. ACM, New York (1999). https://doi.org/10.1145/319151.319164

  3. Ayachit, U., et al.: The SENSEI generic in situ interface. In: Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization (ISAV), pp. 40–44. IEEE (2016)

    Google Scholar 

  4. Bauer, M., Treichler, S., Slaughter, E., Aiken, A.: Legion: expressing locality and independence with logical regions. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, p. 66. IEEE Computer Society Press (2012)

    Google Scholar 

  5. Bustamante, F., Widener, P., Schwan, K.: Scalable directory services using proactivity. In: Proceedings 2002 ACM/IEEE Conference on Supercomputing. ACM/IEEE, Baltimore, November 2002

    Google Scholar 

  6. Dong, B., et al.: Data elevator: low-contention data movement in hierarchical storage system. In: 2016 IEEE 23rd International Conference on High Performance Computing (HiPC), pp. 152–161. IEEE (2016)

    Google Scholar 

  7. Edwards, H.C., Trott, C.R., Sunderland, D.: Kokkos: enabling manycore performance portability through polymorphic memory access patterns. J. Parallel Distrib. Comput. 74(12), 3202–3216 (2014). https://doi.org/10.1016/j.jpdc.2014.07.003. http://www.sciencedirect.com/science/article/pii/S0743731514001257. Domain-Specific Languages and High-Level Frameworks for High-Performance Computing

    Article  Google Scholar 

  8. The Apache Software Foundation: Apache cassandra (2018). https://cassandra.apache.org/. Accessed 10 May 2018

  9. The Apache Software Foundation: Apache spark - unified analytics engine for big data (2018). https://spark.apache.org/. Accessed 10 May 2018

  10. Germain, J.D.d.S., McCorquodale, J., Parker, S.G., Johnson, C.R.: Uintah: a massively parallel problem solving environment. In: 2000 Proceedings the Ninth International Symposium on High-Performance Distributed Computing, pp. 33–41. IEEE (2000)

    Google Scholar 

  11. Ghemawat, S., Dean, J.: LevelDB, a fast and lightweight key/value database library by Google (2014)

    Google Scholar 

  12. google: Github - google/leveldb: Leveldb is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values (2018). https://github.com/google/leveldb. Accessed 10 May 2018

  13. Jones, T., et al.: Unity: unified memory and file space. In: Proceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2017), p. 6. ACM (2017)

    Google Scholar 

  14. Kale, L.V., Krishnan, S.: Charm++: a portable concurrent object oriented system based on C++. ACM SIGPLAN Not. 28, 91–108 (1993)

    Article  Google Scholar 

  15. Lakshman, A., Malik, P.: Cassandra: a decentralized structured storage system. SIGOPS Oper. Syst. Rev. 44(2), 35–40 (2010). https://doi.org/10.1145/1773912.1773922

    Article  Google Scholar 

  16. Moody, A., Bronevetsky, G., Mohror, K., de Supinski, B.R.: Design, modeling, and evaluation of a scalable multi-level checkpointing system. In: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–11. IEEE Computer Society (2010)

    Google Scholar 

  17. Pavlo, A., et al.: A comparison of approaches to large-scale data analysis. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pp. 165–178. ACM (2009)

    Google Scholar 

  18. Pébaÿ, P., et al.: Towards asynchronous many-task in situ data analysis using legion. In: 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, pp. 1033–1037. IEEE (2016)

    Google Scholar 

  19. Ulmer, C., et al.: Faodel: data management for next-generation application workflows. In: Proceedings of the 9th Workshop on Scientific Cloud Computing, p. 8. ACM (2018)

    Google Scholar 

  20. Ulmer, C., et al.: Faodel: data management for next-generation application workflows. In: Proceedings 9th Workshop on Scientific Cloud Computing, Science Cloud 2018. ACM, June 2018

    Google Scholar 

  21. Vahdat, A., Dahlin, M., Anderson, T., Aggarwal, A.: Active names: flexible location and transport of wide-area resources. In: Proceedings USENIX Symposium on Internet Technology and Systems, October 1999

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Patrick Widener .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Widener, P., Ulmer, C., Levy, S., Kordenbrock, T., Templet, G. (2019). Mediating Data Center Storage Diversity in HPC Applications with FAODEL. In: Weiland, M., Juckeland, G., Alam, S., Jagode, H. (eds) High Performance Computing. ISC High Performance 2019. Lecture Notes in Computer Science(), vol 11887. Springer, Cham. https://doi.org/10.1007/978-3-030-34356-9_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-34356-9_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-34355-2

  • Online ISBN: 978-3-030-34356-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics