Abstract
Distributed termination detection concerns detecting the termination of a distributed computation spread across a set of processing nodes. Many solutions have been proposed. However, most solutions have not addressed the problem of the failure of the processing node. In this paper, a termination detection scheme which tolerates node failure is proposed. It is both symmetric and distributed. The scheme can be applied to any kind of connection topology. It uses the FIFO property of a single transmission line to cope with system wide non-FIFO message transmission. Some of the information concerning termination detection is backed up for fault tolerant purposes. The “backing up” does not require the synchronisation of the processing nodes, and it does not interfere with the normal computation. Therefore, the overhead to achieve fault tolerance has been minimised. The scheme is derived using the invariant technique.
Preview
Unable to display preview. Download preview PDF.
References
P. Blanc, Distributed Termination Detection when Messages Arrive out of Sequence, in Parallel Processing, eds. M. Cosnard, M.H. Barton and M. Vanneschi (North-Holland, 1988) 347–360.
E.W. Dijkstra, W.H.J. Feijen and A.J.M. van Gasteren, Derivation of a Distributed Termination Detection Algorithm for Distributed Computations, Information Processing Letters 16(5) (1983) 217–219.
N. Francez, Distributed Termination, ACM TOPLAS 2(1) (1980) 42–55.
S. Huang, A fully distributed termination detection scheme, Information Processing Letters 29(1) (1988) 13–18.
F. Mattern, Asynchronous Distributed Termination — Parallel and Symmetric Solutions with Echo Algorithms, Algorithmica 5 (1990) 325–340.
Y. Min, Fault-tolerant termination detection algorithms for distributed system, Proceedings of Third International Conference For Young Computer Scientists, (1993) 191–192
M. Morganti, Reliable Communications, in Resilience Computing Systems, eds. T. Anderson (Collins, 1985).
R.W. Topor, Termination Detection for Distributed Computation, Information Processing Letters 18(1) (1984) 33–36.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ye, X., Keane, J. (1995). Detecting distributed termination in the presence of node failure. In: Kanchanasut, K., Lévy, JJ. (eds) Algorithms, Concurrency and Knowledge. ACSC 1995. Lecture Notes in Computer Science, vol 1023. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60688-2_45
Download citation
DOI: https://doi.org/10.1007/3-540-60688-2_45
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60688-8
Online ISBN: 978-3-540-49262-7
eBook Packages: Springer Book Archive