An Operating System support to low-overhead communications in NOW clusters

Marenzoni, P.; Rimassa, G.; Vignali, M.; Bertozzi, M.; Conte, G.; Rossi, P.

doi:10.1007/3-540-62573-9_10

P. Marenzoni^1,2,
G. Rimassa^1,2,
M. Vignali^1,2,
M. Bertozzi^1,2,
G. Conte¹ &
…
P. Rossi²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1199))

Included in the following conference series:

International Workshop on Communication, Architecture, and Applications for Network-Based Parallel Computing

104 Accesses
11 Citations

Abstract

This paper describes an Operating System approach to the problem of delivering low latency high bandwidth communications for PC clusters running a public domain OS like Linux and connected by standard, off-the-shelf networks like Fast-Ethernet. The PARMA ² project has the main goal of designing the new light-weight protocol suite PRP, in order to drastically reduce the software overhead introduced by TCP/IP. PRP wants to offer at high level a stream socket oriented interface and at low level compatibility with any device driver. High level compatibility is crucial in facilitating the porting on PRP of existing applications or message passing packages. Moreover, an optimized version of MPI, based on PRP and evolution of the widespread MPICH implementation, is under development, allowing for a very effective reduction of the communication latencies in synchronous communications, compared to the TCP/IP-based MPI.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

G. Bell: 1995 Observations on Supercomputing Alternatives: Did the MPP Bandwagon Lead to a Cul-de-Sac ? Communications of the ACM Vol 30 No. 3 (1996)
Google Scholar
L. Colombet and L. Desbat: Speedup and efficiency of large size applications on heterogeneous networks. Proc. EURO-PAR96 (1996)
Google Scholar
C. C. Lim and J. P. Ang: Experience on Optimization and Parallelization of Existing Scientific Applications on Network of Workstations. Proc. PDPTA96 (1996)
Google Scholar
R. van Drunen, C. van Teylingen and M. Kroontje: The Amfisbaena: A Parallel Supercomputer System Based on i860 as a Generic Platform for Molecular Dynamics Simulations. Proc. PDPTA96 (1996)
Google Scholar
M.A.Blumrich, K.Li, R.Alpert, C.Dubnicki, E.W.Felten, and J.Sandberg: Virtual Memory Mapped Network Interface for the SHRIMP Multicomputer. Proc. “International Symposium on Computer Architecture” ISCA94 (1994) 142–153.
Google Scholar
T. Sterling, D. Savarese, B. Fryxell, K. Olson, and D. J. Becker: Communication Overhead for Space Science Applications on the Beowulf Parallel Workstation. Proc. “High Performance Distributed Computing” HPDC95 Pentagon City Virginia USA (1995)
Google Scholar
H. Lu, S. Dwarkadas, A. L. Cox and W. Zwaenepoel: Message Passing Versus Distributed Shared Memory on Networks of Workstations. Proc. Supercomputing95 (1995)
Google Scholar
G. Chiola and G. Ciaccio: GAMMA: a Low-cost Network of Workstations Based on Active Messages. Proc. “5th EUROMICRO workshop on Parallel and Distributed Processing PDP'97” London UK (1997)
Google Scholar
W. R. Stevens: Unix Network Programming. Prentice Hall New Jersey (1990)
Google Scholar
D. E. Comer and D. L. Stevens: Internetworking with TCP/IP. Prentice Hall New Jersey (1991)
Google Scholar
Z. Xu and K. Hwang: Modeling Communication Overhead: MPI and MPL Performance on the IBM SP2. IEEE Parallel & Distributed Technology Vol. 4 No. 1 (1996) 25–42
Google Scholar
R. W. Hockney: The Communication Challenge for MPP: Intel Paragon and Meiko CS-2. Parallel Computing Vol. 6 No. 3 (1994) 389–398
Google Scholar
J. J. Dongarra and T. Dunigan: Message-Passing Performance of Various Computers. Tec. Report ORNL/TM-13006 Oak Ridge National Laboratory (1996)
Google Scholar
P. Marenzoni: Performance Analysis of Cray T3D and Connection Machine CM-5: a Comparison. Proc. Int. Conf. “High-Performance Computing and Networking HPCN95” Milan Italy Springer-Verlag LNCS 919 (1995) 110–117
Google Scholar
P. Marenzoni and P. Rossi, Benchmark Kernels as a Tool for Performance Evaluation of MPP's, Concurrency Practice and Experience, 1997, in press, John Wiley & Sons.
Google Scholar
G. Destri and P. Marenzoni: Cellular Neural Networks as a General Massively Parallel Computational Paradigm. Special Issue on Cellular Neural Networks of “International Journal of Circuits Theory and Application” Vol. 24 No. 3 (1996) 397–408
Google Scholar
R. M. Butler and E. L. Lusk: Monitors, Messages, and Clusters: The p4 Parallel Programming System. Parallel Computing Vol. 20 (1994) 547–564
Google Scholar
W. Gropp and E. L. Lusk: MPICH Working Note: Creating a New MPICH Device Using the Channel Interface. Tec. Report Argonne National Laboratory
Google Scholar
L.O. Chua and T. Roska: The CNN Paradigm. IEEE Trans. on Circuit and Systems — I Vol. 40 (1993) 147–155
Google Scholar
L.O. Chua and L. Yang: Cellular Neural Network: Theory. IEEE Trans. on Circuit and Systems Vol. 35 (1988) 1257–1272
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Ingegneria dell'Informazione, Università di Parma, Viale delle Scienze, I-43100, Parma, Italy
P. Marenzoni, G. Rimassa, M. Vignali, M. Bertozzi & G. Conte
SMART S.r.l., Via dell'Artigianato 31/2, I-40057, Granarolo Emilia, BO, Italy
P. Marenzoni, G. Rimassa, M. Vignali, M. Bertozzi & P. Rossi

Authors

P. Marenzoni
View author publications
You can also search for this author in PubMed Google Scholar
G. Rimassa
View author publications
You can also search for this author in PubMed Google Scholar
M. Vignali
View author publications
You can also search for this author in PubMed Google Scholar
M. Bertozzi
View author publications
You can also search for this author in PubMed Google Scholar
G. Conte
View author publications
You can also search for this author in PubMed Google Scholar
P. Rossi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Dhabaleswar K. Panda Craig B. Stunkel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Marenzoni, P., Rimassa, G., Vignali, M., Bertozzi, M., Conte, G., Rossi, P. (1997). An Operating System support to low-overhead communications in NOW clusters. In: Panda, D.K., Stunkel, C.B. (eds) Communication and Architectural Support for Network-Based Parallel Computing. CANPC 1997. Lecture Notes in Computer Science, vol 1199. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-62573-9_10

Download citation

DOI: https://doi.org/10.1007/3-540-62573-9_10
Published: 03 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62573-5
Online ISBN: 978-3-540-68085-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics