Skip to main content

A New I/O Architecture for Improving the Performance in Large Scale Clusters

  • Conference paper
Computational Science and Its Applications - ICCSA 2006 (ICCSA 2006)

Abstract

The technology advances made in supercomputers and high performance computing clusters over the past few years have been tremendous. Clusters are the most common solution for high performance computing at the present time. In this kind of systems, an important subject is the parallel I/O subsystem design. Parallel file systems (GPFS, PVFS, Lustre, etc) have been the solution used to obtain high performance I/O. Parallel file systems increase performance by distributing data file across several I/O nodes. However, cluster’s size is increasing continuously, specially for compute nodes, becoming the I/O nodes in a possible bottleneck of the system.

In this paper, we propose a new architecture that solves the problem pointed out before: new hierarchical I/O architecture based on parallel I/O proxies. Those I/O proxies execute on the compute nodes offering an intermediate parallel file system between the applications and the storage system of the cluster. That architecture reduces the load on the I/O nodes increasing the global performance. This paper shows the design of the proposed solution and a preliminary evaluation, using a cluster located in the Stuttgart HLRS center.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vaughan-Nichols, S.J.: New trends revive supercomputing industry.  15(2), 10–13 (2004)

    Google Scholar 

  2. Fryxell, B., Olson, K., Ricker, P., Timmes, F.X., Zingale, M., Lamb, D.Q., Mac- Neice, P., Rosner, R., Truran, J.W., Tufo, H.: FLASH: An Adaptive Mesh Hydrodynamics Code for Modeling Astrophysical Thermonuclear Flashes, vol. 131, pp. 273–334 (2000)

    Google Scholar 

  3. del Rosario, J.M., Bordawekar, R., Choudhary, A.: Improved parallel i/o via a two-phase run-time access strategy. SIGARCH Comput. Archit. News 21(5), 31–38 (1993)

    Article  Google Scholar 

  4. Salem, K., Garcia-Molina, H.: Disk striping. In: Proceedings of the Second International Conference on Data Engineering, California, USA, February 5-7, pp. 336–342. IEEE Computer Society, Los Alamitos (1986)

    Google Scholar 

  5. Patterson, D.A., Gibson, G.A., Katz, R.H.: A case for redundant arrays of inexpensive disks (raid). In: Boral, H., Larson, P.Å. (eds.) Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, Chicago, Illinois, June 1-3, pp. 109–116. ACM Press, New York (1988)

    Chapter  Google Scholar 

  6. Schikuta, E., Wanek, H.: Parallel I/O, vol. 15, pp. 162–168 (2001)

    Google Scholar 

  7. MPI-Forum: Mpi-2: Extensions to the message-passing interface (1997)

    Google Scholar 

  8. Carns, P.H., Ligon III, W.B., Ross, R.B., Thakur, R.: PVFS: A parallel file system for linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, Atlanta, GA, USENIX Association, pp. 317–327 (2000)

    Google Scholar 

  9. Garcia-Carballeira, F., Calderon, A., Carretero, J., Fernandez, J., Perez, J.M.: The design of the Expand parallel file system. The International Journal of High Performance Computing Applications 17(1), 21–38 (2003)

    Article  Google Scholar 

  10. Schmuck, F., Haskin, R.: GPFS: A shared-disk file system for large computing clusters. In: Proc. of the First Conference on File and Storage Technologies (FAST), pp. 231–244 (2002)

    Google Scholar 

  11. Oldfield, R., Kotz, D.: Armada: A parallel file system for computational grids. In: CCGRID 2001: Proceedings of the 1st International Symposium on Cluster Computing and the Grid, Washington, DC, USA, p. 194. IEEE Computer Society, Los Alamitos (2001)

    Chapter  Google Scholar 

  12. Bent, J., Thain, D., Arpaci-Dusseau, A., Arpaci-Dusseau, R.: Explicit control in a batch-aware distributed file system (2004)

    Google Scholar 

  13. Nowoczynski, P., Stone, N., Sommerfield, J., Gill, B., Scott, J.R.: Slash – the scalable lightweight archival storage hierarchy. In: MSST, pp. 245–252 (2005)

    Google Scholar 

  14. Teaff, D., Watson, D., Coyne, B.: The architecture of the high performance storage system (hpss). In: Goddard Conference on Mass Storage and Technologies (1995)

    Google Scholar 

  15. Corbett, P.F., Baylor, S.J., Feitelson, D.G.: Overview of the vesta parallel file system. SIGARCH Comput. Archit. News 21(5), 7–14 (1993)

    Article  Google Scholar 

  16. Calderón, A., Garcıa, F., Carretero, J., Pérez, J.M., Fernández, J.: An implementation of mpi-io on expand: A parallel file system based on nfs servers. In: Proceedings of the 9th European PVM/MPI Users’ Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, London, UK, pp. 306–313. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

García, L.M.S., Isaila, F.D., Carballeira, F.G., Pérez, J.C., Rabenseifner, R., Adamidis, P. (2006). A New I/O Architecture for Improving the Performance in Large Scale Clusters. In: Gavrilova, M.L., et al. Computational Science and Its Applications - ICCSA 2006. ICCSA 2006. Lecture Notes in Computer Science, vol 3984. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11751649_12

Download citation

  • DOI: https://doi.org/10.1007/11751649_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34079-9

  • Online ISBN: 978-3-540-34080-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics