Abstract
The technology advances made in supercomputers and high performance computing clusters over the past few years have been tremendous. Clusters are the most common solution for high performance computing at the present time. In this kind of systems, an important subject is the parallel I/O subsystem design. Parallel file systems (GPFS, PVFS, Lustre, etc) have been the solution used to obtain high performance I/O. Parallel file systems increase performance by distributing data file across several I/O nodes. However, cluster’s size is increasing continuously, specially for compute nodes, becoming the I/O nodes in a possible bottleneck of the system.
In this paper, we propose a new architecture that solves the problem pointed out before: new hierarchical I/O architecture based on parallel I/O proxies. Those I/O proxies execute on the compute nodes offering an intermediate parallel file system between the applications and the storage system of the cluster. That architecture reduces the load on the I/O nodes increasing the global performance. This paper shows the design of the proposed solution and a preliminary evaluation, using a cluster located in the Stuttgart HLRS center.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vaughan-Nichols, S.J.: New trends revive supercomputing industry. 15(2), 10–13 (2004)
Fryxell, B., Olson, K., Ricker, P., Timmes, F.X., Zingale, M., Lamb, D.Q., Mac- Neice, P., Rosner, R., Truran, J.W., Tufo, H.: FLASH: An Adaptive Mesh Hydrodynamics Code for Modeling Astrophysical Thermonuclear Flashes, vol. 131, pp. 273–334 (2000)
del Rosario, J.M., Bordawekar, R., Choudhary, A.: Improved parallel i/o via a two-phase run-time access strategy. SIGARCH Comput. Archit. News 21(5), 31–38 (1993)
Salem, K., Garcia-Molina, H.: Disk striping. In: Proceedings of the Second International Conference on Data Engineering, California, USA, February 5-7, pp. 336–342. IEEE Computer Society, Los Alamitos (1986)
Patterson, D.A., Gibson, G.A., Katz, R.H.: A case for redundant arrays of inexpensive disks (raid). In: Boral, H., Larson, P.Å. (eds.) Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, Chicago, Illinois, June 1-3, pp. 109–116. ACM Press, New York (1988)
Schikuta, E., Wanek, H.: Parallel I/O, vol. 15, pp. 162–168 (2001)
MPI-Forum: Mpi-2: Extensions to the message-passing interface (1997)
Carns, P.H., Ligon III, W.B., Ross, R.B., Thakur, R.: PVFS: A parallel file system for linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, Atlanta, GA, USENIX Association, pp. 317–327 (2000)
Garcia-Carballeira, F., Calderon, A., Carretero, J., Fernandez, J., Perez, J.M.: The design of the Expand parallel file system. The International Journal of High Performance Computing Applications 17(1), 21–38 (2003)
Schmuck, F., Haskin, R.: GPFS: A shared-disk file system for large computing clusters. In: Proc. of the First Conference on File and Storage Technologies (FAST), pp. 231–244 (2002)
Oldfield, R., Kotz, D.: Armada: A parallel file system for computational grids. In: CCGRID 2001: Proceedings of the 1st International Symposium on Cluster Computing and the Grid, Washington, DC, USA, p. 194. IEEE Computer Society, Los Alamitos (2001)
Bent, J., Thain, D., Arpaci-Dusseau, A., Arpaci-Dusseau, R.: Explicit control in a batch-aware distributed file system (2004)
Nowoczynski, P., Stone, N., Sommerfield, J., Gill, B., Scott, J.R.: Slash – the scalable lightweight archival storage hierarchy. In: MSST, pp. 245–252 (2005)
Teaff, D., Watson, D., Coyne, B.: The architecture of the high performance storage system (hpss). In: Goddard Conference on Mass Storage and Technologies (1995)
Corbett, P.F., Baylor, S.J., Feitelson, D.G.: Overview of the vesta parallel file system. SIGARCH Comput. Archit. News 21(5), 7–14 (1993)
Calderón, A., Garcıa, F., Carretero, J., Pérez, J.M., Fernández, J.: An implementation of mpi-io on expand: A parallel file system based on nfs servers. In: Proceedings of the 9th European PVM/MPI Users’ Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, London, UK, pp. 306–313. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
García, L.M.S., Isaila, F.D., Carballeira, F.G., Pérez, J.C., Rabenseifner, R., Adamidis, P. (2006). A New I/O Architecture for Improving the Performance in Large Scale Clusters. In: Gavrilova, M.L., et al. Computational Science and Its Applications - ICCSA 2006. ICCSA 2006. Lecture Notes in Computer Science, vol 3984. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11751649_12
Download citation
DOI: https://doi.org/10.1007/11751649_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34079-9
Online ISBN: 978-3-540-34080-5
eBook Packages: Computer ScienceComputer Science (R0)