Abstract
This paper discusses the modeling and analysis of three major fault-tolerant software system architectures: DRB (Distributed Recovery Blocks), NVP (N-Version Programming) and NSCP (N Self-Checking Programming). In the system-level reliability modeling domain, fault tree analysis techniques and Markov modeling techniques are combined to incorporate transient and permanent hardware faults as well as unrelated and related software faults. These models are parameterized by a real-world fault-tolerant flight control computer application for evaluations and comparisons. In particular, a series of sensitivity analysis is performed to explore the critical components in each fault-tolerant architecture and display their quantitative impacts to the overall system reliability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Algirdas AviŽienis. The N-version approach to fault-tolerant software. IEEE Transactions on Software Engineering, SE-11(12):1491–1501, December 1985.
P.G. Bishop, D.G. Esp, M. Barnes, P. Humphreys, G. Dahl, and J. Lahti. PODS — a project of diverse software. IEEE Transactions on Software Engineering, SE-12(9):929–940, September 1986.
D. Briere and P. Traverse. Airbus A320/A330/A340 electrical flight controls: A family of fault-tolerant systems. In Proceedings of the 23rd Symposium on Fault Tolerant Computing, pages 616-623, 1993.
Joanne Bechta Dugan and K. S. Trivedi. Coverage modeling for dependability analysis of fault-tolerant systems. IEEE Transactions on Computers, 38(6):775–787, 1989.
Robert Geist and Kishor Trivedi. Reliability estimation of fault-tolerant systems: Tools and techniques. IEEE Computer, pages 52-61, July 1990.
Gunnar Hagelin. ERICSSON safety system for railway control. In U. Voges, editor, Software Diversity in Computerized Control Systems, pages 11-21. Springer-Verlag, 1988.
A. D. Hills. Digital fly-by-wire experience. In Proceedings AGARD Lecture Series, number 143, October 1985.
Allen M. Johnson and Miroslaw Malek. Survey of software tools for evaluating reliability availability, and serviceability. ACM Computing Surveys, 20(4):227–269, December 1988.
K.H. Kim and Howard O. Welch. Distributed execution of recovery blocks: An approach for uniform treatment of hardware and software faults in real-time applications. IEEE Transactions on Computers, 38(5):626–636, May 1989.
Jaynarayan H. Lala and Linda S. Alger. Hardware and software fault tolerance: A unified architectural approach. In Proc. IEEE Int. Symp. on Fault-Tolerant Computing, FTCS-18, pages 240-245, June 1988.
Jean-Claude Laprie. Dependability evaluation of software systems in operation. IEEE Transactions on Software Engineering, SE-10(6):701–714, November 1984.
Jean-Claude Laprie, Jean Arlat, Christian Beounes, and Karama Kanoun. Definition and Analysis of Hardware-and Software-Fault-Tolerant Architectures. IEEE Computer, pages 39-51, July 1990.
Jean-Claude Laprie and Karama Kanoun. X-ware reliability and availability modeling. IEEE Transactions on Software Engineering, pages 130-147, February, 1992.
Michael R. Lyu and Yu-Tao He. Improving the N-version programming process through the evolution of a design paradigm. IEEE Transactions on Reliability, June 1993.
C. V. Ramamoorthy, Y. Mok, F. Bastani, G. Chin, and K. Suzuki. Application of a methodology for the development and validation of reliable process control software. IEEE Transactions on Software Engineering, SE-7(6):537–555, November 1981.
Brian Randell. System structure for software fault tolerance. IEEE Transactions on Software Engineering, SE-l(2):220–232, June 1975.
R. Sahner and K. S. Trivedi. Reliability modeling using SHARPE. IEEE Transactions on Reliability, R-36(2): 186–193, June 1987.
R. Keith Scott, James W. Gault, and David F. McAllister. Fault-tolerantsoftware reliability modeling. IEEE Transactions on Software Engineering, SE-13(5):582–592, May 1987.
Kang G. Shin and Yann-Hang Lee. Evaluation of error recovery blocks used for cooperating processes. IEEE Transactions on Software Engineering, SE-10(6):692–700, November 1984.
George. E. Stark. Dependability evaluation of integrated hardware/software systems. IEEE Transactions on Reliability, pages 440-444, October 1987.
Pascal Traverse. Airbus and ATR system architecture and specification. In U. Voges, editor, Software Diversity in Computerized Control Systems, pages 95-104. Springer-Verlag, June 1986.
Udo Voges. Use of diversity in experimental reactor safety systems. In U. Voges, editor, Software Diversity in Computerized Control Systems, pages 29-49. Springer-Verlag, 1988.
L. J. Yount. Architectural solutions to safety problems of digital flight-critical systems for commercial transports. In Proceedings AIAA/IEEE Digital Avionics Systems Conference, pages 1-8, December 1984.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer-Verlag/Wien
About this paper
Cite this paper
Dugan, J.B., Lyu, M.R. (1995). System-Level Reliability and Sensitivity Analyses for Three Fault-Tolerant System Architectures. In: Cristian, F., Le Lann, G., Lunt, T. (eds) Dependable Computing for Critical Applications 4. Dependable Computing and Fault-Tolerant Systems, vol 9. Springer, Vienna. https://doi.org/10.1007/978-3-7091-9396-9_37
Download citation
DOI: https://doi.org/10.1007/978-3-7091-9396-9_37
Publisher Name: Springer, Vienna
Print ISBN: 978-3-7091-9398-3
Online ISBN: 978-3-7091-9396-9
eBook Packages: Springer Book Archive