Skip to main content

Modelling and Verification of a Checkpoint-Restart-Protocol

  • Conference paper
Fehlertolerierende Rechensysteme

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 84))

Abstract

The paper presents a protocol for the treatment of temporary local crashes and communication failures in systems of communicating processes. Assuming the existence of a global partitioning of the processes’ interactions into a sequence of segments as well as the availability of a reliable memory to each participant, the recovery strategy lies in the provident permanent recording and discarding of checkpoint data and, in cases of failure, common restarts from some recent global checkpoint (boundary between two segments). Apart from the inherent limitations imposed by data flow and resources the processes may run totally asynchronously. The protocol is modelled by a PrT-net, which permits to formally state and verify the protocol’s service, namely guaranteeing liveness and the eventual progress of the system’s actions. The final part of this paper is devoted to the structure and basic ideas of this proof.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baumgarten, B., and Ochsenschläger, P. On Termination and Phase Changes in the Presence of Unreliable Communication Arbeitspapier der GMD, St. Augustin (1984)

    Google Scholar 

  2. Baumgarten, B., and Ochsenschläger, P. Modellierung und Verifikation eines Checkpoint-/Restart-Verfahrens Arbeitspapier der GMD, St. Augustin (1984)

    Google Scholar 

  3. Genrich, H. J., and Lautenbach, K. System Modelling with High-Level Petri Nets Theoretical Computer Science 13, 109–136 (1981)

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1984 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Baumgarten, B., Ochsenschläger, P. (1984). Modelling and Verification of a Checkpoint-Restart-Protocol. In: Großpietsch, KE., Dal Cin, M. (eds) Fehlertolerierende Rechensysteme. Informatik-Fachberichte, vol 84. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-69698-5_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-69698-5_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-13348-3

  • Online ISBN: 978-3-642-69698-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics