Advertisement

Efficient Implementation of Data Objects in the OSD+-Based Fusion Parallel File System

  • Juan Piernas
  • Pilar González-Férez
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10659)

Abstract

OSD+s are enhanced object-based storage devices (OSDs) able to deal with both data and metadata operations via data and directory objects, respectively. So far, we have focused on designing and implementing efficient directory objects in OSD+s. This paper, however, presents our work on also supporting data objects, and describes how the coexistence of both kinds of objects in OSD+s is profited to efficiently implement data objects and to speed up some common file operations. We compare our OSD+-based Fusion Parallel File System (FPFS) with Lustre and OrangeFS. Results show that FPFS provides a performance up to \(37{\times }\) better than Lustre, and up to \(95{\times }\) better than OrangeFS, for metadata workloads. FPFS also provides 34% more bandwidth than OrangeFS for data workloads, and competes with Lustre for data writes. Results also show serious scalability problems in Lustre and OrangeFS.

Keywords

FPFS OSD+ Data objects Lustre OrangeFS 

Notes

Acknowledgements

Work supported by the Spanish MEC, and European Commission FEDER funds, under grants TIN2012-38341-C04-03 and TIN2015-66972-C5-3-R.

References

  1. 1.
    Avilés-González, A., Piernas, J., González-Férez, P.: Scalable metadata management through OSD+ devices. Int. J. Parallel Program. 42(1), 4–29 (2014)CrossRefGoogle Scholar
  2. 2.
    Avilés-González, A., Piernas, J., González-Férez, P.: Batching operations to improve the performance of a distributed metadata service. J. Supercomput. 72(2), 654–687 (2016)CrossRefGoogle Scholar
  3. 3.
    Bent, J., Gibson, G., Grider, G., McClelland, B., Nowoczynski, P., Nunez, J., Polte, M., Wingate, M.: PLFS: a checkpoint filesystem for parallel applications. In: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC 2009), pp. 1–12 (2009)Google Scholar
  4. 4.
    Brandt, S.A., Miller, E.L., Long, D.D.E., Xue., L.: Efficient metadata management in large distributed storage systems. In: Proceedings of the 20th IEEE Conference on Mass Storage Systems and Technologies (MSST 2003), pp. 290–298 (2003)Google Scholar
  5. 5.
    Cray Inc.: HPCS-IO, October 2012. http://sourceforge.net/projects/hpcs-io
  6. 6.
    Dilger, A.: Lustre metadata scaling, April 2012. http://storageconference.us/2012/Presentations/T01.Dilger.pdf. Tutorial at the 28th IEEE Conference on Massive Data Storage (MSST 2012)
  7. 7.
    OpenSFS, EOFS: The Lustre file system, December 2016. http://www.lustre.org
  8. 8.
    Patil, S., Ren, K., Gibson, G.: A case for scaling HPC metadata performance through de-specialization. In: Proceedings of 7th Petascale Data Storage Workshop Supercomputing (PDSW 2012), pp. 1–6, November 2012Google Scholar
  9. 9.
    Skeen, D., Stonebraker, M.: A formal model of crash recovery in a distributed system. IEEE Trans. Softw. Eng. 9(3), 219–228 (1983)CrossRefGoogle Scholar
  10. 10.
    The PVFS Community: The Orange file system, October 2016. http://orangefs.org
  11. 11.
    Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D.E., Maltzahn, C.: Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2006), pp. 307–320 (2006)Google Scholar
  12. 12.
    Wheeler, R.: One billion files: scalability limits in Linux file systems. In: LinuxCon 2010, August 2010. http://events.linuxfoundation.org/slides/2010/linuxcon2010_wheeler.pdf

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Departamento de Ingeniería y Tecnología de ComputadoresUniversidad de MurciaMurciaSpain

Personalised recommendations