Tracing Fault Tolerance

Schepers, Henk

doi:10.1007/978-3-7091-4009-3_4

Henk Schepers⁴

Part of the book series: Dependable Computing and Fault-Tolerant Systems ((DEPENDABLECOMP,volume 8))

60 Accesses
7 Citations

Abstract

A fault may cause a process to behave abnormally, and a fault hypothesis divides such abnormal behaviour into exceptional and catastrophic behaviours. The set of normal and exceptional behaviours can be considered the set of acceptable behaviours. In this report traces, or communication histories, are used to denote the behaviour of a process. The semantic function ℋ[P] defines the set of possible communication sequences that can be observed up to any point in an execution of process P. A fault hypothesis is defined as a predicate representing a reflexive relation between the normal and acceptable histories of a process. Such relations enable one to abstract from the precise nature of a fault and to focus on the exceptional behaviour it causes. For a fault hypothesis χ the construct (P≀χ) indicates execution of process P under the assumption of χ. Then, the set ℋ[(P≀χ)] is the set of acceptable histories of P with respect to χ.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

K. A. Bartlett, R. A. Scantlebury, P. T. Wilkinson. A note on reliable full-duplex transmission over half-duplex links. Communications of the ACM, Vol. 12, No. 5, 1969, pp. 260–261.
Article Google Scholar
F. Cristian. A rigorous approach to fault-tolerant programming. IEEE Transaction on Software Engineering, Vol. SE-11, No. 1, pp. 23–31, 1985.
Article Google Scholar
T. A. Henzinger, Z. Manna, A. Pnueli. Timed transition systems. Lecture Notes in Computer Science, Vol. 600, Springer-Verlag, 1992, pp. 226–251.
Article MathSciNet Google Scholar
C. A. R. Hoare. Communicating Sequential Processes. Prentice-Hall International, 1985.
Google Scholar
M. Joseph, A. Moitra, N. Soundararajan. Proof rules for fault tolerant distributed programs. Science of Computer Programming, Vol. 8, 1987, pp. 43–67.
Article MathSciNet MATH Google Scholar
L. Lamport. Time, clocks, and the ordering of events in a distributed system. Communications of the ACM, Vol. 21, No. 7, 1978, pp. 558–565.
Article MATH Google Scholar
J. C. Laprie. Dependable computing and fault tolerance: concepts and terminology. Proc. 15th IEEE Int. Symp. on Fault Tolerant Computing, Ann Arbor, Mich., 1985, pp. 2-11.
Google Scholar
P. A Lee, T. Anderson. Fault tolerance: principles and practice. Springer-Verlag, 1990.
Google Scholar
J. Peleska. Design and verification of fault tolerant systems with CSP. Distributed Computing, Vol. 5, 1991, pp. 95–106.
Article MATH Google Scholar
H. Schepers. Terminology and paradigms for fault tolerance. Report CSN 91-08, Eindhoven University of Technology, 1991. Also to appear in: J. Vytopil (ed.). Formal Techniques in Real-Time and Fault Tolerant Systems. Kluwer Academic Publishers, 1993.
Google Scholar
R. D. Schlichting, F. B. Schneider. Fail-stop processors: an approach to designing fault tolerant computing systems. ACM Transaction on Computer Systems, Vol. 1, No. 3, 1983, pp. 222–238.
Article Google Scholar
F. B. Schneider. Implementing fault tolerant services using the state machine approach: a tutorial. ACM Computing Surveys, Vol. 22, No. 4, 1990, pp. 299–319.
Article Google Scholar
D. G. Weber. Formal specification of fault-tolerance and its relation to computer security. ACM Software Engineering Notes, Vol. 14, No. 3, 1989, pp. 273–277.
Article Google Scholar
J. Zwiers. Compositionality, concurrency and partial correctness. Lecture Notes in Computer Science, Vol. 321, Springer-Verlag, 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computing Science, Eindhoven University of Technology, P.O. Box 513, 5600 MB, Eindhoven, The Netherlands
Henk Schepers

Authors

Henk Schepers
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Naval Research Laboratory, Washington DC, 20375-5000, USA
Carl E. Landwehr
Dept. of Computer Sc., Univ. of Newcastle, Newcastle upon Tyne, NE1 7RU, UK
Brian Randell
Dip. Ingegneria dell’Informazione, Università di Pisa, I-56100, Pisa, Italia
Luca Simoncini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schepers, H. (1993). Tracing Fault Tolerance. In: Landwehr, C.E., Randell, B., Simoncini, L. (eds) Dependable Computing for Critical Applications 3. Dependable Computing and Fault-Tolerant Systems, vol 8. Springer, Vienna. https://doi.org/10.1007/978-3-7091-4009-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-7091-4009-3_4
Publisher Name: Springer, Vienna
Print ISBN: 978-3-7091-4011-6
Online ISBN: 978-3-7091-4009-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics