A Numerical Technique for the Hierarchical Evaluation of Large, Closed Fault-Tolerant Systems
This paper describes a novel approach for evaluating the reliability of large fault-tolerant systems. The design hierarchy of the system is preserved during the evaluation, allowing large systems to be analyzed. Semi-Markov models are used at each level in the hierarchy, and a numerical technique is used to combine models from a given level for use at the next level. Different values of parameters, such as coverage, can then be used appropriately at any level, resulting in a much more accurate prediction of reliability. The proposed technique has been validated through comparison with analytical calculations, results from existing tools and Monte-Carlo simulation.
KeywordsTime Slice Design Level Memory Module Large State Space Subsystem Model
Unable to display preview. Download preview PDF.
- W. Carter and J. Abraham, “Design and evaluation tools for fault-tolerant systems,” in Proceedings of the AIAA Computers in Aerospace VI Conference, pp. 70-77, October 1987.Google Scholar
- R. Geist and K. Trivedi, “Reliability estimation of fault-tolerant systems: Tools and techniques,” Computer, pp. 52-61, July 1990.Google Scholar
- W. G. Bouricius, W. C. Carter, and P. R. Schneider, “Reliability modeling techniques for self-repairing computer systems,” in Proceedings of the 12th ACM National Conference, pp. 295-309, August 1969.Google Scholar
- J. Dugan, K. Trivedi, M. Smotherman, and R. Geist, “The hybrid automated reliability predictor,” AIAA Journal of Guidance, Control, and Dynamics, pp. 319-331, May–June 1986.Google Scholar
- R. Sahner and K. Trivedi, “A hierarchial combinatorial-markov method for solving complex reliability models,” in Proceedings ACM/IEEE Fall Joint Computing Conference, November 1986.Google Scholar
- A. Goyal, W. Carter, E. de Souza e Silva, S. Lavenberg, and K. Trivedi, “The system availability estimator,” in Proceedings of the 16th IEEE Fault-Tolerant Computing Symposium, pp. 84-89, July 1986.Google Scholar
- S. Bavuso, J. Brunelle, and P. Peterson, “Care iii hands-on demonstration and tutorial,” Technical Memorandum 85811, NASA, May 1984.Google Scholar
- S. Bavuso, P. Peterson, and D. Rose, “Care iii model overview and user’s guide,” Technical Memorandum 85810, NASA, June 1984.Google Scholar
- S. Makam and A. Avizienis, “Aries 81: A reliability and life-cycle evaluation tool for fault-tolerant systems,” in Proceedings of the IEEE 12th Fault-Tolerant Computing Symposium, pp. 267-274, June 1982.Google Scholar
- K. Trivedi and R. Geist, “A tutorial on the care iii approach to reliability modeling,” Contractor Report 3488, NASA, December 1981.Google Scholar
- HARP: The Hybrid Automated Reliability Predictor Introduction and User’ s Guide.Google Scholar