Abstract
The design of fault-tolerant systems should ideally be based on rigorous predictions of the effects of design decisions on the achieved dependability. However, the complexity of the task is such that these decisions are typically based on ingrained, time-proven practice, without the benefit of thorough mathematical analysis. We analyse two specific problems in fault-tolerant design based on modular replication (with or without design diversity). First, we consider adjudication, i.e., the derivation of a single correct result from the multiple results produced by the replicas in a redundant component. Many designs have been proposed in the literature, supposed to improve upon simple majority voting, but without a unified, rigorous analysis to assist design choices. We describe such a general method for evaluating and comparing adjudicators, in probabilistic terms, and specify an optimal adjudicator, which yields the highest possible reliability for a redundant component, given the (probabilistic) failure characteristics of its subcomponents. Our analysis applies to components with and without a fail-safe mode. Second, we consider fault treatment: how the decision can be made to remove a replica of a component, considering it permanently failed, on the basis of its history of agreement/disagreement with other replicas. The problem is compounded by transient faults, which make it undesirable to disconnect a component at the first signs of errors, and by the use of dynamic error processing, in which the number of replicas executed depends on whether disagreements are observed. For this problem, we choose a scheme integrating dynamic error processing with diagnosis and disconnection of components that may be permanently failed, and show how its behaviour can be compared with alternative designs via simulation.
This paper is a compendium of [44] and [26].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 ECSC — EC — EAEC, Brussels — Luxembourg
About this paper
Cite this paper
Bondavalli, A., Chiaradonna, S., Di Giandomenico, F., Strigini, L. (1995). Rational Design of Multiple-Redundant Systems: Adjudication and Fault Treatment. In: Randell, B., Laprie, JC., Kopetz, H., Littlewood, B. (eds) Predictably Dependable Computing Systems. ESPRIT Basic Research Series. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-79789-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-79789-7_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-79791-0
Online ISBN: 978-3-642-79789-7
eBook Packages: Springer Book Archive