Summary
This paper deals with fast simulation techniques for estimating transient measures in highly dependable systems. The systems we consider consist of components with generally distributed lifetimes and repair times, with complex interaction among components. As is well known, standard simulation of highly dependable systems is very inefficient and importance sampling is widely used to improve efficiency. We present two new techniques, one of which is based on the uniformization approach to simulation, and the other is a natural extension of the uniformization approach which we call exponential transformation. We show that under certain assumptions, these techniques have the bounded relative error property, i.e., the relative error of the simulation estimate remains bounded as components become more and more reliable, unlike standard simulation in which it tends to infinity. This implies that only a fixed number of observations are required to achieve a given relative error, no matter how rare the failure events are.
This paper was originally published in ACM Transactions on Modeling and Computer Simulation 4, 137–164 (1994). ©1994, Association for Computing Machinery, Inc. (ACM). Reprinted with permission.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barlow, R.E., Proschan, F.:Statistical Theory of Reliability and Life Testing. New York: Holt, Reinhart and Winston 1981
Brown, M.: Error Bounds for Exponential Approximations of Geometric Convolutions. The Annals of Probability 18, 1388–1402 (1990)
Carrasco, J. A.: Failure Distance-Based Simulation of Repairable Fault-Tolerant Systems. Proceedings of the Fifth International Conference on Modelling Techniques and Tools for Computer Performance Evaluation (1991a), pp. 337–351
Carrasco, J.A.: Efficient Transient Simulation of Failure/Repair Markovian Models. Proceedings of the Tenth Symposium on Reliable and Distributed Computing. IEEE Press 1991b, pp. 152–161
Dugan, J.B., Trivedi, K.S., Smotherman M.K., Geist, R.M.: The Hybrid Automated Reliability Predictor. Journal of Guidance, Control and Dynamics 9, 319–331 (1986)
Fox, B.L., Glynn, P.W.: Discrete-Time Conversion for Simulating Finite-Horizon Markov Processes. SIAM J. Appl. Math. 50, 1457–1473 (1990)
Frater, M.R., Lennon, T.M., Anderson, B.D.O.: Optimally Efficient Estimation of the Statistics of Rare Events in Queueing Networks. IEEE Transactions on Automatic Control 36, 1395–1405 (1991)
Geist, R.M., Smotherman, M.K.: Ultrahigh Reliability Estimates through Simulation. Proceedings of the Annual Reliability and Maintainability Symposium. IEEE Press 1989, pp. 350–355
Geist, R.M., Trivedi, K.S.: Ultra-High Reliability Prediction for Fault-Tolerant Computer Systems. IEEE Transactions on Computers C-32, 1118–1127 (1983)
Gertsbakh, I.B.: Asymptotic Methods in Reliability Theory: A Review. Advances in Applied Probability 16, 147–175 (1984)
Glynn, P.W.: A GSMP Formalism for Discrete Event Systems. Proceedings of the IEEE 77, 14–23 (1989)
Glynn, P.W.: Importance Sampling for Markov Chains: Asymptotics for the Variance. Technical Report. Dept. of Operations Research, Stanford University (1992)
Glynn, P.W., Heidelberger, P., Nicola, V.F., and Shahabuddin, P.: Efficient Estimation of the Mean Time Between Failures in Non-Regenerative Dependability Models. IBM Research Report RC 19080. Yorktown Heights, New York (1993)
Glynn, P.W., Iglehart, D.L.: Importance Sampling for Stochastic Simulations. Management Science 35, 1367–1392 (1989)
Goyal, A., Lavenberg, S.S.: Modeling and Analysis of Computer System Availability. IBM Journal of Research and Development 31, 651–664 (1987)
Goyal, A., Shahabuddin, P., Heidelberger, P., Nicola, V.F., Glynn, P.W.: A Unified Framework for Simulating Markovian Models of Highly Reliable Systems. IEEE Transactions on Computers C-41, 36–51 (1992)
Gross, D., Miller, D.R.: The Randomization Technique as a Modeling Tool and Solution Procedure for Transient Markov Processes. Operations Research 32, 343–361 (1984)
Hammersley, J.M., Handscomb, D.C.: Monte Carlo Methods. London: Methuen 1964
Heidelberger, P.: Fast Simulation of Rare Events in Queueing and Reliability Models. ACM Transactions on Modeling and Computer Simulation 5, 43–85 (1995)
Heidelberger, P., Nicola, V.F., Shahabuddin, P.: Simultaneous and Efficient Simulation of Highly Dependable Systems with Different Underlying Distributions. Proceedings of the 1992 Winter Simulation Conference. IEEE Press 1992, pp. 458–465
Jensen, A.: Markov Chains as an Aid in the Study of Markov Processes. Skand. Aktuarietidskr. 36, 87–91 (1953)
Juneja, S., Shahabuddin, P.: Fast Simulation of Markovian Reliability/Availability Models with General Repair Policies. Proceedings of the Twenty-Second International Symposium on Fault-Tolerant Computing. IEEE Computer Society Press 1992, pp. 150–159
Keilson, J.: Markov Chain Models - Rarity and Exponentiality. New York: Springer 1979
Lewis, E.E., Bohm, F.: Monte Carlo Simulation of Markov Unreliability Models. Nuclear Engineering and Design 77, 49–62 (1984)
Lewis, P.A.W., Shedler, G.S.: Simulation of Nonhomogeneous Poisson Processes by Thinning. Naval Research Logistics Quarterly 26, 403–413 (1979)
Moorsel, A.P.A. van, Haverkort, B.R., Niemegeers, I.G.:. Fault Injection Simulation: A Variance Reduction Technique for Systems with Rare Events. Dependable Computing for Critical Applications 2. Berlin: Springer 1991, pp. 115–134
Nakayama, M.K.: A Characterization of the Simple Failure Biasing Method for Simulations of Highly Reliable Markovian Systems. ACM Transactions on Modeling and Computer Simulation 4, 52–88 (1994)
Nakayama, M.K.: General Conditions for Bounded Relative Error in Simulations of Highly Reliable Markovian Systems. IBM Research Report RC 18993. Yorktown Heights, New York (1993)
Nakayama, M.K.: Simulation of Highly Reliable Markovian and Non-Markovian Systems. Ph.D. Dissertation, Department of Operations Research, Stanford University (1991)
Nicola, V.F., Heidelberger, P., Shahabuddin, P.: Uniformization and Exponential Transformation: Techniques for Fast Simulation of Highly Dependable NonMarkovian Systems. Proceedings of the Twenty-Second International Symposium on Fault-Tolerant Computing. IEEE Computer Society Press 1992, pp. 130–139
Nicola, V.F., Nakayama, M.K., Heidelberger, P., Goyal, A.: Fast Simulation of Dependability Models with General Failure, Repair and Maintenance Processes. Proceedings of the Twentieth International Symposium on Fault-Tolerant Computing. IEEE Computer Society Press 1991, pp. 491–498
Nicola, V.F., Shahabuddin, P., Heidelberger, P.: Techniques for Fast Simulation of Highly Dependable Systems. Proceedings of the Second International Workshop on Performability Modelling of Computer and Communication Systems (1993)
Nicola, V.F., Shahabuddin, P., Heidelberger, P., Glynn, P.W: Fast Simulation of Steady-State Availability in Non-Markovian Highly Dependable Systems. Proceedings of the Twenty-Third International Symposium on Fault-Tolerant Computing. IEEE Computer Society Press 1992, pp. 38–47
Parekh, S., Walrand, J: A Quick Simulation Method for Excessive Backlogs in Networks of Queues. IEEE Transactions on Automatic Control 34, 54–56 (1989)
Sadowsky, J.S.: Large Deviations and Efficient Simulation of Excessive Backlogs in a GI/G/m Queue. IEEE Transactions on Automatic Control 36, 1383–1394 (1991)
Shahabuddin, P.: Simulation and Analysis of Highly Reliable Systems. Ph.D. Dissertation, Department of Operations Research, Stanford University (1990)
Shahabuddin, P.: Importance Sampling for the Simulation of Highly Reliable Markovian Systems. Management Science 40, 333–352 (1994a)
Shahabuddin, P.: Fast Transient Simulation of Markovian Models of Highly Dependable Systems. Performance Evaluation 20, 267–286 (1994b)
Shahabuddin, P., Nakayama, M. K.: Estimation of Reliability and its Derivatives for Large Time Horizons in Markovian Systems. Proceedings of 1993 Winter Simulation Conference. IEEE Press 1993, pp. 422–429
Shanthikumar, J.G.: Uniformization and Hybrid Simulation/Analytic Models of Renewal Processes. Operations Research 34, 573–580 (1986)
Stuffier, J., Bryant, L.: CARE III Phase III Report-Mathematical Description. NASA Contractor Report 3566 (1982)
Van Dijk, N.M.: On a Simple Proof of Uniformization for Continuous and Discrete-State Continuous-Time Markov Chains. Adv. Appl. Prob. 22, 749–750 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Heidelberger, P., Shahabuddin, P., Nicola, V.F. (1996). Bounded Relative Error in Estimating Transient Measures of Highly Dependable Non-Markovian Systems. In: Özekici, S. (eds) Reliability and Maintenance of Complex Systems. NATO ASI Series, vol 154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-03274-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-662-03274-9_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-08250-4
Online ISBN: 978-3-662-03274-9
eBook Packages: Springer Book Archive