Skip to main content

Detecting distributed termination in the presence of node failure

  • Concurrency and Networking
  • Conference paper
  • First Online:
Algorithms, Concurrency and Knowledge (ACSC 1995)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1023))

Included in the following conference series:

Abstract

Distributed termination detection concerns detecting the termination of a distributed computation spread across a set of processing nodes. Many solutions have been proposed. However, most solutions have not addressed the problem of the failure of the processing node. In this paper, a termination detection scheme which tolerates node failure is proposed. It is both symmetric and distributed. The scheme can be applied to any kind of connection topology. It uses the FIFO property of a single transmission line to cope with system wide non-FIFO message transmission. Some of the information concerning termination detection is backed up for fault tolerant purposes. The “backing up” does not require the synchronisation of the processing nodes, and it does not interfere with the normal computation. Therefore, the overhead to achieve fault tolerance has been minimised. The scheme is derived using the invariant technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. P. Blanc, Distributed Termination Detection when Messages Arrive out of Sequence, in Parallel Processing, eds. M. Cosnard, M.H. Barton and M. Vanneschi (North-Holland, 1988) 347–360.

    Google Scholar 

  2. E.W. Dijkstra, W.H.J. Feijen and A.J.M. van Gasteren, Derivation of a Distributed Termination Detection Algorithm for Distributed Computations, Information Processing Letters 16(5) (1983) 217–219.

    Google Scholar 

  3. N. Francez, Distributed Termination, ACM TOPLAS 2(1) (1980) 42–55.

    Google Scholar 

  4. S. Huang, A fully distributed termination detection scheme, Information Processing Letters 29(1) (1988) 13–18.

    Google Scholar 

  5. F. Mattern, Asynchronous Distributed Termination — Parallel and Symmetric Solutions with Echo Algorithms, Algorithmica 5 (1990) 325–340.

    Google Scholar 

  6. Y. Min, Fault-tolerant termination detection algorithms for distributed system, Proceedings of Third International Conference For Young Computer Scientists, (1993) 191–192

    Google Scholar 

  7. M. Morganti, Reliable Communications, in Resilience Computing Systems, eds. T. Anderson (Collins, 1985).

    Google Scholar 

  8. R.W. Topor, Termination Detection for Distributed Computation, Information Processing Letters 18(1) (1984) 33–36.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Kanchana Kanchanasut Jean-Jacques Lévy

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ye, X., Keane, J. (1995). Detecting distributed termination in the presence of node failure. In: Kanchanasut, K., Lévy, JJ. (eds) Algorithms, Concurrency and Knowledge. ACSC 1995. Lecture Notes in Computer Science, vol 1023. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60688-2_45

Download citation

  • DOI: https://doi.org/10.1007/3-540-60688-2_45

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-60688-8

  • Online ISBN: 978-3-540-49262-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics