Skip to main content

On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems

  • Conference paper
  • 733 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4395))

Abstract

This paper evaluates the impact of the parallel I/O scheduling strategy on the performance of the file access in a parallel file system for clusters of commodity computers (Clusterfile). We argue that the parallel I/O scheduling strategy should be seen as a complement to other file access optimizations like striping over several I/O servers, non-contiguous I/O and collective I/O. Our study is based on three simple decentralized parallel I/O heuristics implemented inside Clusterfile. The measurements in a real environment show that the performance of parallel file access may vary with as much as 86% for writing and 804% for reading with the employed heuristic and with the schedule block granularity.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. DeBenedictis, E., Rosario, J.D.: nCUBE Parallel I/O Software. In: Proceedings of 11th International Phoenix Conference on Computers and Communication (1992)

    Google Scholar 

  2. LoVerso, S.J., et al.: sfs: A Parallel File System for the CM-5. In: Proceedings of the Summer 1993 USENIX Conference, pp. 291–305 (1993)

    Google Scholar 

  3. Huber, J.V., et al.: PPFS: A High Performance Portable File System. In: Proceedings of the 9th ACM International Conference on Supercomputing, ACM Press, New York (1995)

    Google Scholar 

  4. Corbett, P.F., Feitelson, D.G.: The Vesta Parallel File System. ACM Transactions on Computer Systems (1996)

    Google Scholar 

  5. Carretero, J., et al.: ParFiSys: A Parallel File System for MPP. ACM SIGOPS 30(2) (1996)

    Google Scholar 

  6. Freedman, C.S., Burger, J., DeWitt, D.J.: SPIFFI-A Scalable Parallel File System for the Intel Paragon. IEEE Transactions on Parallel and Distributed Systems (October 1996)

    Google Scholar 

  7. Nieuwejaar, N., Kotz, D.: The Galley Parallel File System. Parallel Computing (1997)

    Google Scholar 

  8. O’Keefe, M.T.: Shared file systems and fibre channel. In: Proceedings of the Sixth NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies (1998)

    Google Scholar 

  9. Ligon, W.B., Ross, R.B.: An Overview of the Parallel Virtual File System. In: Proceedings of the Extreme Linux Workshop (June 1999)

    Google Scholar 

  10. Schmuck, F., Haskin, R.: GPFS: A Shared-Disk File System for Large Computing Clusters. In: Proceedings of FAST (2002)

    Google Scholar 

  11. Garcia-Carballeira, F., et al.: The Design of the Expand Parallel File System. The International Journal of High Performance Computing Applications 17(1), 21–38 (2003)

    Article  Google Scholar 

  12. Winslett, M., et al.: The Panda library for parallel I/O of large multidimensional arrays. In: Proceedings of Scalable Parallel Libraries Conference III (October 1996)

    Google Scholar 

  13. Message Passing Interface Forum: MPI2: Extensions to the Message Passing Interface (1997)

    Google Scholar 

  14. Thakur, R., Gropp, W., Lusk, E.: Data Sieving and Collective I/O in ROMIO. In: Proc. of the 7th Symposium on the Frontiers of Massively Parallel Computation, February 1999, pp. 182–189 (1999)

    Google Scholar 

  15. Thakur, R., Gropp, W., Lusk, E.: On Implementing MPI-IO Portably and with High Performance. In: Proc. of the Sixth Workshop on I/O in Parallel and Distributed Systems, May 1999, pp. 23–32 (1999)

    Google Scholar 

  16. Isaila, F., Tichy, W.: View I/O:improving the performance of non-contiguous I/O. In: Third IEEE International Conference on Cluster Computing, Hong Kong, Dec. 2003, pp. 336–343. IEEE Computer Society Press, Los Alamitos (2003)

    Chapter  Google Scholar 

  17. del Rosario, J., Bordawekar, R., Choudhary, A.: Improved parallel I/O via a two-phase run-time access strategy. In: Proc. of IPPS Workshop on Input/Output in Parallel Computer Systems (1993)

    Google Scholar 

  18. Kotz, D.: Disk-directed I/O for MIMD Multiprocessors. In: Proc. of the First USENIX Symp. on Operating Systems Design and Implementation (1994)

    Google Scholar 

  19. Isaila, F., et al.: Integrating Collective I/O and Cooperative Caching into the “Clusterfile” Parallel File System. In: Proceedings of ACM International Conference on Supercomputing (ICS), Saint Malo, France, pp. 315–324. ACM Press, New York (2004)

    Google Scholar 

  20. Isaila, F., Tichy, W.: Clusterfile: A flexible physical layout parallel file system. Concurrency and Computation: Practice and Experience 15(7–8), 653–679 (2003)

    Article  Google Scholar 

  21. Jain, R., et al.: Heuristics for scheduling I/O operations. IEEE Transactions on Parallel and Distributed Systems 8(3), 310–320 (1997), http://www.computer.org/pubs/tpds/1997/310.pdf

    Article  Google Scholar 

  22. Chen, F., Majumdar, S.: Performance of parallel I/O scheduling strategies on a network of workstations. In: Proceedings of ICPADS 2001, Fort Lauderdale, Florida, April 2001, pp. 157–164 (2001)

    Google Scholar 

  23. Abawajy, J.H.: Performance Analysis of Parallel I/O Scheduling Approaches on Cluster Computing Systems. In: CCGRID ’03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid, Washington, DC, USA, p. 724. IEEE Computer Society Press, Los Alamitos (2003)

    Chapter  Google Scholar 

  24. Durand, D., Jain, R., Tseytlin, D.: Parallel I/O scheduling using randomized, distributed edge coloring algorithms. J. Parallel Distrib. Comput. 63(6), 611–618 (2003)

    Article  MATH  Google Scholar 

  25. Lumb, C.R., Golding, R.A., Ganger, G.R.: D-SPTF: decentralized request distribution in brick-based storage systems. In: ASPLOS, pp. 37–47 (2004)

    Google Scholar 

  26. Pai, V., et al.: Locality-Aware Request Distribution in Cluster-based Network Servers. In: Proceedings of the ACM Eighth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-VIII), October 1998, ACM Press, New York (1998)

    Google Scholar 

  27. Lebre, A., Denneulin, Y., Van, T.T.: Controlling and Scheduling Parallel I/O in Multi-application Environments. Technical report, INRIA (2005)

    Google Scholar 

  28. Dahlin, M., et al.: Cooperative Caching: Using Remote Client Memory to Improve File System Performance. In: The First Symp. on Operating Systems Design and Implementation (Nov. 1994)

    Google Scholar 

  29. Wang, R.Y., Anderson, T.E., Dahlin, M.D.: Experience with a distributed file system implementation with adaptive. Technical report (1998)

    Google Scholar 

  30. Myricom. GM: the low-level message-passing system for Myrinet networks (2000), http://www.myri.com/

  31. Wong, P., der Wijngaart, R.: NAS Parallel Benchmarks I/O Version 2.4. Technical Report NAS-03-002, NASA Ames Research Center, Moffet Field, CA (2003)

    Google Scholar 

  32. Message Passing Interface Forum: MPI: A Message-Passing Interface Standard (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Michel Daydé José M. L. M. Palma Álvaro L. G. A. Coutinho Esther Pacitti João Correia Lopes

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Isailă, F., Singh, D., Carretero, J., Garcia, F. (2007). On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2006. VECPAR 2006. Lecture Notes in Computer Science, vol 4395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71351-7_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71351-7_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71350-0

  • Online ISBN: 978-3-540-71351-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics