Advertisement

A Peer-to-Peer Framework for Robust Execution of Message Passing Parallel Programs on Grids

  • Stéphane Genaud
  • Choopan Rattanapoka
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3666)

Abstract

This paper presents P2P-MPI, a middleware aimed at computational grids. From the programmer point of view, P2P-MPI provides a message-passing programming model which enables the development of MPI applications for grids. Its originality lies in its adaptation to unstable environments. First, the peer-to-peer design of P2P-MPI allows for a dynamic discovery of collaborating resources. Second, it gives the user the possibility to adjust the robustness of an execution thanks to an internal process replication mechanism. Finally, we measure the middleware performances on two NAS benchmarks.

Keywords

Grid Middleware Peer-to-peer MPI Java 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    Bailey, D.H., Barszcz, E., Barton, J.T., Browning, D.S., Carter, R.L., Dagum, D., Fatoohi, R.A., Frederickson, P.O., Lasinski, T.A., Schreiber, R.S., Simon, H.D., Venkatakrishnan, V., Weeratunga, S.K.: The NAS Parallel Benchmarks. The Intl. Journal of Supercomputer Applications 5(3), 63–73 (1991)CrossRefGoogle Scholar
  3. 3.
    Bouteiller, A., Cappello, F., Hérault, T., Krawezik, G., Lemarinier, P., Magniette, F.: MPIch-V2: a fault tolerant MPI for volatile nodes based on the pessimistic sender based message logging. In: SuperComputing 2003, Phoenix USA (November 2003)Google Scholar
  4. 4.
    Caron, E., Deprez, F., Frédéric Lombard, F., Nicod, J.-M., Quinson, M., Suter, F.: A scalable approach to network enabled servers. In: Monien, B., Feldmann, R.L. (eds.) Euro-Par 2002. LNCS, vol. 2400, pp. 907–910. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  5. 5.
    Carpenter, B., Getov, V., Judd, G., Skjellum, T., Fox, G.: Mpj: Mpi-like message passing for java. Concurrency: Practice and Experience 12(11) (September 2000)Google Scholar
  6. 6.
    Schneider, F.: Replication Management Using State-Machine Approach. In: Mullender, S. (ed.) Distributed Systems, ch 7, pp. 169–198. Addison Wesley, Reading (1993)Google Scholar
  7. 7.
    Fagg, G., Dongarra, J.J.: FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world. In: EuroPVM/MPI 2000, pp. 346–353. Springer, Heidelberg (2000)Google Scholar
  8. 8.
    Fedak, G., Germain, C., Néri, V., Cappello, F.: XtremWeb: A generic global computing system. In: CCGRID, pp. 582–587. IEEE Computer Society, Los Alamitos (2001)Google Scholar
  9. 9.
    Gabriel, E., Resch, M., Beisel, T., Keller, R.: Distributed Computing in an Heterogeneous Computing Environment. In: EuroPVM/MPI, pp. 180–187. Springer, Heidelberg (1998)Google Scholar
  10. 10.
    Karonis, N.T., Toonen, B.T., Foster, I.: MPICH-G2: A Grid-enabled implementation of the Message Passing Interface. Journal of Parallel and Distributed Computing, special issue on Computational Grids 63(5), 551–563 (2003)zbMATHCrossRefGoogle Scholar
  11. 11.
    Kielmann, T., Hofman, R.F.H., Bal, H.E., Plaat, A., Bhoedjang, R.A.F.: MagPIe: MPI’s collective communication operations for clustered wide area systems. ACM SIGPLAN Notices 34(8), 131–140 (1999)CrossRefGoogle Scholar
  12. 12.
    Louca, S., Neophytou, N., Lachanas, A., Evripidou, P.: MPI-FT: Portable fault tolerenace scheme for MPI. Parallel Processing Letters 10(4), 371–382 (2000)CrossRefGoogle Scholar
  13. 13.
    MPI Forum. MPI: A message passing interface standard. Technical report, University of Tennessee, Knoxville, TN, USA (June 1995)Google Scholar
  14. 14.
    Renesse, R.V., Minsky, Y., Hayden, M.: A gossip-style failure detection service. Technical report, Ithaca, NY, USA (1998)Google Scholar
  15. 15.
    Shudo, K., Tanaka, Y., Sekiguchi, S.: P3: P2P-based middleware enabling transfer and aggregation of computational resource. In: 5th Intl. Workshop on Global and Peer-to-Peer Computing, in conjunc. with CCGrid 2005, May 2005. IEEE, Los Alamitos (2005)Google Scholar
  16. 16.
    Thakur, R., Rabenseifner, R., Gropp, W.: Optimization of collective communication operation in mpich. International Journal of High Performance Computing Applications 19(1), 49–66 (2005)CrossRefGoogle Scholar
  17. 17.
    Verbeke, J., Nadgir, N., Ruetsch, G., Sharapov, I.: Framework for peer-to-peer distributed computing in a heterogeneous, decentralized environment. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 1–12. Springer, Heidelberg (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Stéphane Genaud
    • 1
  • Choopan Rattanapoka
    • 1
  1. 1.ICPS-LSIIT – UMR CNRS-ULP 7005UniversitéLouis PasteurStrasbourg

Personalised recommendations