Abstract
A task with ideal execution time \(\ell \) is handled by a Markovian system with features similar to the ones in classical reliability. The Markov states are of two types, UP and DOWN, such that the task only can be processed in an UP state. Upon entrance to a DOWN state, processing is stopped and must be restarted from the beginning upon the next entrance to an UP state. The total task time \(X=X_\mathrm {r}(\ell )\) (including restarts and pauses in failed states) is investigated with particular emphasis on the expected value \({\pmb {\mathbb {E}}}[X_\mathrm {r}(\ell )]\), for which an explicit formula is derived that applies for all relevant systems. In general, transitions between UP and DOWN states are interdependent, but simplifications are pointed out when the UP to DOWN rate matrix (or the DOWN to UP) has rank one. A number of examples are studied in detail and an asymptotic exponential form \(\exp (\beta _\mathrm {m} \ell )\) is found for the expected total task time \({\pmb {\mathbb {E}}}[X(\ell )]\) as \(\ell \rightarrow \infty \). Also, the asymptotic behavior of the total distribution, \(H_\mathrm {r}(x|\ell )\rightarrow \exp (-x\gamma (\ell ))\), as \(x\rightarrow \infty \) is discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For clarity, note that in general the boldfaced subscripts identify the entire matrix, not its components. The components are identified by the notation, \((\mathbf{Q_{ud}})_{\mathrm {ij}}\), \(i\in E_\mathrm {u}\), and \(j\in E_\mathrm {d}\).
- 2.
Of course, even if all the Cov coefficients were 0, that would not be sufficient to have independence. Just one nonzero coefficient proves dependence.
- 3.
Strictly speaking, this is only true if \(\sigma \) is unique. If it is a multiple root then the formula is more complicated.
- 4.
spr[\(\mathbf{X}\)] = SPectral Radius := Largest eigenvalue in magnitude of matrix \(\mathbf{X}\).
- 5.
\(\mathbf{D(\ell \beta _\mathrm {m})}\) is a matrix whose components are polynomials of degree less than or equal to the degree of \(p(\ell \beta _\mathrm {m})\). Therefore, \(\mathbf{D}(\ell \beta _\mathrm {m})/p(\ell \beta _\mathrm {m})\sim {O(1)}\).
References
Asmussen S, Fiorini PM, Lipsky L, Rolski T, Sheahan R (2008) Asymptotic behavior of total times for jobs that must start over if a failure occurs. Math Oper Res 33(4):932–944
Asmussen S, Lipsky L, Thompson S (2014) Checkpointing in failure recovery in computing and data transmission. In: Sericola B, Telek M, Horváth G (eds) Analytical and stochastic modelling techniques and applications—21st international conference, ASMTA 2014, Budapest, Hungary, 30 June–2 July 2014. Proceedings, vol 8499 of Lecture Notes in Computer Science, pp 253–272. Springer
Bobbio A, Trivedi K (1990) Computation of the distribution of the completion time when the work requirement is a ph random variable. Stoch Models 6:133–150
Castillo X, Siewiorek DP (1980) A performance-reliability model for computing systems. In: Proceedings of the FTCS-10, Silver Spring, MD, IEEE Computer Society, pp 187–192
Chimento Jr PF, Trivedi KS (1993) The completion time of programs on processors subject to failure and repair. IEEE Trans Comput 42(1)
Chlebus BS, De Prisco R, Shvartsman AA (2001) Performing tasks on synchronous restartable message-passing processors. Distrib Comput 14:49–64
De Prisco R, Mayer A, Yung M (1994) Time-optimal message-efficient work performance in the presence of faults. In: Proceedings of the 13th ACM PODC, pp 161–172
Fiorini PM (1998) Modeling telecommunication systems with self-similar data traffic. Department of Computer Science, University of Connecticut, phd thesis edition
Gross D, Harris C (1998) Fundamentals of queueing theory, 3rd edn. Wiley
Jagerman D (1982) An inversion technique for the laplace transform. Bell Syst Tech J 61:1995–2002
Kulkarni V, Nicola V, Trivedi K (1986) On modeling the performance and reliability of multimode systems. J Syst Softw 6:175–183
Kulkarni V, Nicola V, Trivedi K (1987) The completion time of a job on a multimode system. Adv Appl Probab 19:932–954
Lipsky L (2008) Queueing theory. A linear algebraic approach, 2nd edn. Springer
Rubino G, Sericola B (2014) Markov chains and dependability theory, 1st edn. Cambridge University Press
Sheahan R, Lipsky L, Fiorini P, Asmussen S (2006) On the distribution of task completion times for tasks that must restart from the beginning if failure occurs. SIGMETRICS Perform Eval Rev 34:24–26
Søren A, Lester L, Asmussen S, Lipsky L, Thompson S (2015) Markov renewal methods in restart problems in complex systems. Thiele Research Reports, Department of Mathematical Sciences, University of Aarhus, vol 156
Trivedi K (2001) Probability and statistics with reliability, queueing, and computer science applications, 2nd edn. Wiley-Interscience
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Thompson, S., Lipsky, L., Asmussen, S. (2016). Linear Algebraic Methods in RESTART Problems in Markovian Systems. In: Fiondella, L., Puliafito, A. (eds) Principles of Performance and Reliability Modeling and Evaluation. Springer Series in Reliability Engineering. Springer, Cham. https://doi.org/10.1007/978-3-319-30599-8_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-30599-8_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30597-4
Online ISBN: 978-3-319-30599-8
eBook Packages: EngineeringEngineering (R0)