Skip to main content

Performance Evaluation of Group Communication Architectures in Large Scale Systems Using MPI

  • Conference paper
On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE (OTM 2006)

Abstract

Group communication is an important paradigm for fault tolerance in large scale systems. We describe various group architectures as pipelined, hierarchical, daisy and hypercube groups each consisting of separate clusters, investigate the theoretical performance bounds of these architectures and evaluate their experimental performances using MPI group communication primitives. We first derive time bounds for multicast message deliveries in these architectures and then provide tests to measure the times taken for the same operation. The multicast message delivery times are tested against the number of clusters within a group and the size of the multicast message. We conclude that daisy architecture is favorable both in terms of delivery times and message sizes theoretically and experimentally.

An erratum to this chapter can be found at http://dx.doi.org/10.1007/11914952_55.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Birman, K.P., van Renesse, R.: Reliable Distributed Computing with the Isis Toolkit. IEEE Computer Society Press, Los Alamitos (1994)

    Google Scholar 

  2. Chockler, G., Keidar, I., Vitenberg, R.: Group communication specifications: a comprehensive study. ACM Computing Surveys 33(4), 427–469 (2001)

    Article  Google Scholar 

  3. Cristian, F.: Synchronous and Asynchronous Communication. Communications of the ACM. Special Section on Group Communication 39(4) (1996)

    Google Scholar 

  4. Amir, Y., et al.: Transis: A communication subsystem for high availability. In: Proc. of 22nd IEEE Int’l. Symp. on Fault-Tolerant Computing, pp. 76–84. IEEE Press, NJ

    Google Scholar 

  5. Van Renesse, R., Birman, K.P., Maffeis, S.: Horus: A Flexible Group communication System. CACM, Special sect. on Group Comm. 39(4) (1996)

    Google Scholar 

  6. Amir, Y., et al.: The TOTEM Single Ring Ordering and membership Protocol. ACM Trans. Comp. Systems 13(4) (1995)

    Google Scholar 

  7. Keidar, I., et al.: Moshe: A group membership service for WANs. ACM Transactions on Computer Systems (TOCS) 20(3), 191–238 (2002)

    Article  Google Scholar 

  8. Gropp, W., Lusk, E., Doss, N., Skjellum, A.: A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard. In: MPI Developers Conference (1995)

    Google Scholar 

  9. Squyres, J.M., Lumsdaine, A., George, W.L., Hagedorn, J.G., Devaney, J.E.: The Interoperable Message Passing Interface (IMPI) Extensions to LAM/MPI. In: MPI Developers Conference, Ithica, NY (2000)

    Google Scholar 

  10. Yuan, X., Daniels, S., Faraj, A., Karwande, A.: Group Management Schemes for Implementing MPI Collective Communication over IP Multicast. In: The 6th Int. Conf. on Computer Science and Informatics, Durham, NC, pp. 8–14 (2002)

    Google Scholar 

  11. Quinn, M.J.: Parallel Programming in C with MPI and OpenMP, International Edition. Mc Graw Hill (2003)

    Google Scholar 

  12. Tunali, T., Erciyes, K., Soysert, Z.: A Hierarchical Fault-Tolerant Ring Protocol For A Distributed Real-Time System. Special issue of Parallel and Distributed Computing Practices on Parallel and Distributed Real-Time Systems 2(1), 33–44 (2000)

    Google Scholar 

  13. Allahverdi, N., Kahramanli, S., Erciyes, K.: A Fault Tolerant Routing Algorithm Based on Cube Algebra for Hypercube Systems. JSA 46(2), 201–205 (2000)

    Google Scholar 

  14. Grama, A., Gupta, A., Karypis, G., Kumar, V.: Introduction to Parallel Computing, 2nd edn. Addison Wesley Longman, Inc., Amsterdam (2003)

    Google Scholar 

  15. Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organizations. Int. Journal of High Performance Computing Applications 15(3), 200–222 (2001)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Erciyes, K., Dagdeviren, O., Payli, R.Ü. (2006). Performance Evaluation of Group Communication Architectures in Large Scale Systems Using MPI. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE. OTM 2006. Lecture Notes in Computer Science, vol 4276. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11914952_28

Download citation

  • DOI: https://doi.org/10.1007/11914952_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-48274-1

  • Online ISBN: 978-3-540-48283-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics