Skip to main content

Application and Middleware Transparent Checkpointing with TCKPT on Clustergrid

  • Conference paper
  • 473 Accesses

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   89.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. I. Foster, C. Kesselman, S. Tuecke, “The Anatomy of the Grid. Enabling Scalable Virtual Organizations”, Intern. Journal of Supercomputer Applications, 15(3), 2001

    Google Scholar 

  2. Elnozahy E N, Johnson D B, Wang Y M. “A Survey of Rollback Recovery Protocols in Message-Passing System.” Technical Report. Pittsburgh, PA: CMU-CS-96-181. Carnegie Mellon University, Oct 1996

    Google Scholar 

  3. K.M. Chandy and L. Lamport. „Distributed snapshots: Determining global states of distributed systems”, ACM Transactions on Computer Systems, 3(1):63-75, February 1985.

    Article  Google Scholar 

  4. G. Stellner, “Consistent Checkpoints of PVM Applications”, In Proc. 1st Euro. PVM Users Group Meeting, 1994

    Google Scholar 

  5. M. Litzkow, T. Tannenbaum, J. Basney, and M. Livny, “Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System”, Technical Report #1346, Computer Sciences Department, University of Wisconsin, April 1997

    Google Scholar 

  6. J. Léon, A. L. Fisher, and P. Steenkiste, “Fail-safe PVM: a portable package for distributed programming with transparent recovery”. CMU-CS-93-124. February, 1993

    Google Scholar 

  7. G.D. van Albada; J. Clinckemaillie; A.H.L. Emmen; J. Gehring; O. Heinz; F. van der Linden; B.J. Overeinder; A. Reinefeld and P.M.A. Sloot: „Dynamite - blasting obstacles to parallel cluster computing”, in P.M.A. Sloot; M. Bubak; A.G. Hoekstra and L.O. Hertzberger, editors, High-Performance Computing and Networking (HPCN Europe '99), Amsterdam, The Netherlands, in series Lecture Notes in Computer Science, nr 1593 pp. 300-310. Springer-Verlag, Berlin, April 1999. ISBN 3-540-65821-1.

    Google Scholar 

  8. Jozsef Kovacs: “Making PVM applications checkpointable for the Grid” Proc. of the Microcad 2005 Conference, Section N, pp. 223-228, Marcius 10-11, 2005, Miskolc

    Google Scholar 

  9. http://checkpointing.psnc.pl/Progress/psncLibCkpt

  10. Gracjan Jankowski, Rafal Mikolajczak, Radoslaw Januszewski: “Checkpoint/Restart mechanism for multiprocess applications implemented under SGIGrid Project”, Proceedings of the Cracow GridWorkshop 2004, pp.142 149, ISBN: 83-911541-4-5, 2005.

    Google Scholar 

  11. G. Jankowski, R. Januszewski, R. Mikolajczak, J. Kovacs: "Scalable multilevel checkpointing for distributed applications - on the integration possibility of TCKPT and psncLibCkpt ", CoreGRID Technical Report, TR-0019, March 2006

    Google Scholar 

  12. G. Jankowski, R. Januszewski, R. Mikolajczak, J. Kovacs: "Scalable multilevel checkpointing for distributed applications - on the possibility of integrating Total Checkpoint and AltixC/R", CoreGRID Technical Report, TR-0035, March 2006

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Science+Business Media, LLC

About this paper

Cite this paper

Kovács, J., Mikolajczak, R., Januszewski, R., Jankowski, G. (2007). Application and Middleware Transparent Checkpointing with TCKPT on Clustergrid. In: Kacsuk, P., Fahringer, T., Németh, Z. (eds) Distributed and Parallel Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-69858-8_18

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-69858-8_18

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-69857-1

  • Online ISBN: 978-0-387-69858-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics