On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems

Isailă, Florin; Singh, David; Carretero, Jesús; Garcia, Félix

doi:10.1007/978-3-540-71351-7_10

On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems

Florin Isailă¹,
David Singh¹,
Jesús Carretero¹ &
…
Félix Garcia¹

Conference paper

733 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4395))

Abstract

This paper evaluates the impact of the parallel I/O scheduling strategy on the performance of the file access in a parallel file system for clusters of commodity computers (Clusterfile). We argue that the parallel I/O scheduling strategy should be seen as a complement to other file access optimizations like striping over several I/O servers, non-contiguous I/O and collective I/O. Our study is based on three simple decentralized parallel I/O heuristics implemented inside Clusterfile. The measurements in a real environment show that the performance of parallel file access may vary with as much as 86% for writing and 804% for reading with the employed heuristic and with the schedule block granularity.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

DeBenedictis, E., Rosario, J.D.: nCUBE Parallel I/O Software. In: Proceedings of 11th International Phoenix Conference on Computers and Communication (1992)
Google Scholar
LoVerso, S.J., et al.: sfs: A Parallel File System for the CM-5. In: Proceedings of the Summer 1993 USENIX Conference, pp. 291–305 (1993)
Google Scholar
Huber, J.V., et al.: PPFS: A High Performance Portable File System. In: Proceedings of the 9th ACM International Conference on Supercomputing, ACM Press, New York (1995)
Google Scholar
Corbett, P.F., Feitelson, D.G.: The Vesta Parallel File System. ACM Transactions on Computer Systems (1996)
Google Scholar
Carretero, J., et al.: ParFiSys: A Parallel File System for MPP. ACM SIGOPS 30(2) (1996)
Google Scholar
Freedman, C.S., Burger, J., DeWitt, D.J.: SPIFFI-A Scalable Parallel File System for the Intel Paragon. IEEE Transactions on Parallel and Distributed Systems (October 1996)
Google Scholar
Nieuwejaar, N., Kotz, D.: The Galley Parallel File System. Parallel Computing (1997)
Google Scholar
O’Keefe, M.T.: Shared file systems and fibre channel. In: Proceedings of the Sixth NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies (1998)
Google Scholar
Ligon, W.B., Ross, R.B.: An Overview of the Parallel Virtual File System. In: Proceedings of the Extreme Linux Workshop (June 1999)
Google Scholar
Schmuck, F., Haskin, R.: GPFS: A Shared-Disk File System for Large Computing Clusters. In: Proceedings of FAST (2002)
Google Scholar
Garcia-Carballeira, F., et al.: The Design of the Expand Parallel File System. The International Journal of High Performance Computing Applications 17(1), 21–38 (2003)
Article Google Scholar
Winslett, M., et al.: The Panda library for parallel I/O of large multidimensional arrays. In: Proceedings of Scalable Parallel Libraries Conference III (October 1996)
Google Scholar
Message Passing Interface Forum: MPI2: Extensions to the Message Passing Interface (1997)
Google Scholar
Thakur, R., Gropp, W., Lusk, E.: Data Sieving and Collective I/O in ROMIO. In: Proc. of the 7th Symposium on the Frontiers of Massively Parallel Computation, February 1999, pp. 182–189 (1999)
Google Scholar
Thakur, R., Gropp, W., Lusk, E.: On Implementing MPI-IO Portably and with High Performance. In: Proc. of the Sixth Workshop on I/O in Parallel and Distributed Systems, May 1999, pp. 23–32 (1999)
Google Scholar
Isaila, F., Tichy, W.: View I/O:improving the performance of non-contiguous I/O. In: Third IEEE International Conference on Cluster Computing, Hong Kong, Dec. 2003, pp. 336–343. IEEE Computer Society Press, Los Alamitos (2003)
Chapter Google Scholar
del Rosario, J., Bordawekar, R., Choudhary, A.: Improved parallel I/O via a two-phase run-time access strategy. In: Proc. of IPPS Workshop on Input/Output in Parallel Computer Systems (1993)
Google Scholar
Kotz, D.: Disk-directed I/O for MIMD Multiprocessors. In: Proc. of the First USENIX Symp. on Operating Systems Design and Implementation (1994)
Google Scholar
Isaila, F., et al.: Integrating Collective I/O and Cooperative Caching into the “Clusterfile” Parallel File System. In: Proceedings of ACM International Conference on Supercomputing (ICS), Saint Malo, France, pp. 315–324. ACM Press, New York (2004)
Google Scholar
Isaila, F., Tichy, W.: Clusterfile: A flexible physical layout parallel file system. Concurrency and Computation: Practice and Experience 15(7–8), 653–679 (2003)
Article Google Scholar
Jain, R., et al.: Heuristics for scheduling I/O operations. IEEE Transactions on Parallel and Distributed Systems 8(3), 310–320 (1997), http://www.computer.org/pubs/tpds/1997/310.pdf
Article Google Scholar
Chen, F., Majumdar, S.: Performance of parallel I/O scheduling strategies on a network of workstations. In: Proceedings of ICPADS 2001, Fort Lauderdale, Florida, April 2001, pp. 157–164 (2001)
Google Scholar
Abawajy, J.H.: Performance Analysis of Parallel I/O Scheduling Approaches on Cluster Computing Systems. In: CCGRID ’03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid, Washington, DC, USA, p. 724. IEEE Computer Society Press, Los Alamitos (2003)
Chapter Google Scholar
Durand, D., Jain, R., Tseytlin, D.: Parallel I/O scheduling using randomized, distributed edge coloring algorithms. J. Parallel Distrib. Comput. 63(6), 611–618 (2003)
Article MATH Google Scholar
Lumb, C.R., Golding, R.A., Ganger, G.R.: D-SPTF: decentralized request distribution in brick-based storage systems. In: ASPLOS, pp. 37–47 (2004)
Google Scholar
Pai, V., et al.: Locality-Aware Request Distribution in Cluster-based Network Servers. In: Proceedings of the ACM Eighth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-VIII), October 1998, ACM Press, New York (1998)
Google Scholar
Lebre, A., Denneulin, Y., Van, T.T.: Controlling and Scheduling Parallel I/O in Multi-application Environments. Technical report, INRIA (2005)
Google Scholar
Dahlin, M., et al.: Cooperative Caching: Using Remote Client Memory to Improve File System Performance. In: The First Symp. on Operating Systems Design and Implementation (Nov. 1994)
Google Scholar
Wang, R.Y., Anderson, T.E., Dahlin, M.D.: Experience with a distributed file system implementation with adaptive. Technical report (1998)
Google Scholar
Myricom. GM: the low-level message-passing system for Myrinet networks (2000), http://www.myri.com/
Wong, P., der Wijngaart, R.: NAS Parallel Benchmarks I/O Version 2.4. Technical Report NAS-03-002, NASA Ames Research Center, Moffet Field, CA (2003)
Google Scholar
Message Passing Interface Forum: MPI: A Message-Passing Interface Standard (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Compute Science, University Carlos III de Madrid, Spain
Florin Isailă, David Singh, Jesús Carretero & Félix Garcia

Authors

Florin Isailă
View author publications
You can also search for this author in PubMed Google Scholar
David Singh
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Carretero
View author publications
You can also search for this author in PubMed Google Scholar
Félix Garcia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Michel Daydé José M. L. M. Palma Álvaro L. G. A. Coutinho Esther Pacitti João Correia Lopes

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Isailă, F., Singh, D., Carretero, J., Garcia, F. (2007). On Evaluating Decentralized Parallel I/O Scheduling Strategies for Parallel File Systems. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2006. VECPAR 2006. Lecture Notes in Computer Science, vol 4395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71351-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-540-71351-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71350-0
Online ISBN: 978-3-540-71351-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics