Advertisement

ClusterGrind: Valgrinding LAM/MPI Applications

  • Brett Carson
  • Ian A. Mason
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3666)

Abstract

Debugging distributed applications using message passing libraries can be extremely difficult. We have implemented a set of tools collectively called ClusterGrind which interface to a GNU licensed debugger, valgrind, to ease the debugging process. By generating useful, customisable reports, we believe the time spent debugging large distributed Linux applications can be reduced significantly. Profiling the running programs is also possible to find coding inefficiencies, to aid in improving the overall application performance.

Keywords

valgrind debugging profiling message passing reporting 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Brett Carson, B., Mason, I.A.: ClusterGrind (2005), http://mcs.une.edu.au/~bcarson/ClusterGrind/
  2. 2.
    Nethercote, N., Seward, J.: Valgrind: A Program Supervision Framework. Electronic Notes in Theoretical Computer Science 89 (2003)Google Scholar
  3. 3.
    Seward, J.: The Valgrind Homepage (2005), http://valgrind.org/
  4. 4.
    Burns, G., Daoud, R., Vaigl, J.: LAM: An Open Cluster Environment for MPI. In: Proceedings of Supercomputing Symposium, pp. 379–386 (1994)Google Scholar
  5. 5.
    Squyres, J.M., Lumsdaine, A.: A Component Architecture for LAM/MPI. In: Proceedings of the 10th European PVM/MPI Users’ Group Meeting, pp. 379–387 (2003)Google Scholar
  6. 6.
    Carson, B., Murison, R., Mason, I.A.: Computational Gains Using RPVM on a Beowulf Cluster. R News 3, 21–26 (2003)Google Scholar
  7. 7.
    Carson, B., Murison, R., Mason, I.A.: Estimating Breeding Values on a Beowulf Cluster. Technical Report, School of Mathematics, Statistics and Computer Science, University of New England (2005)Google Scholar
  8. 8.
    Etnus LLC: The Totalview Debugger (2005), http://www.etnus.com
  9. 9.
    Allinea Software: Distributed Debugging Tool (2005), http://www.absoft.com/
  10. 10.
    Open Systems Lab: LAM/MPI User’s Guide (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Brett Carson
    • 1
  • Ian A. Mason
    • 1
  1. 1.School of Mathematics, Statistics and Computer ScienceUniversity of New England 

Personalised recommendations