Delayed Nondeterminism in Continuous-Time Markov Decision Processes

Neuhäußer, Martin R.; Stoelinga, Mariëlle; Katoen, Joost-Pieter

doi:10.1007/978-3-642-00596-1_26

Martin R. Neuhäußer^17,18,
Mariëlle Stoelinga¹⁸ &
Joost-Pieter Katoen^17,18

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5504))

Included in the following conference series:

International Conference on Foundations of Software Science and Computational Structures

996 Accesses
20 Citations

Abstract

Schedulers in randomly timed games can be classified as to whether they use timing information or not. We consider continuous-time Markov decision processes (CTMDPs) and define a hierarchy of positional (P) and history-dependent (H) schedulers which induce strictly tighter bounds on quantitative properties on CTMDPs. This classification into time abstract (TA), total time (TT) and fully time-dependent (T) schedulers is mainly based on the kind of timing details that the schedulers may exploit. We investigate when the resolution of nondeterminism may be deferred. In particular, we show that TTP and TAP schedulers allow for delaying nondeterminism for all measures, whereas this does neither hold for TP nor for any TAH scheduler. The core of our study is a transformation on CTMDPs which unifies the speed of outgoing transitions per state.

Supported by the NWO projects QUPES (612.000.420) and FOCUS/BRICKS (642.000.505) (MOQS) and by the EU grants IST-004527 (ARTIST2) and FP7-ICT-2007-1 (QUASIMODO).

Download to read the full chapter text

Chapter PDF

Trace Relations and Logical Preservation for Continuous-Time Markov Decision Processes

Cost vs. time in stochastic games and Markov automata

Article 09 January 2017

Optimal Continuous Time Markov Decisions

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Knast, R.: Continuous-time probabilistic automata. Inform. and Control 15, 335–352 (1969)
Article MathSciNet MATH Google Scholar
Guo, X., Hernández-Lerma, O., Prieto-Rumeau, T.: A survey of recent results on continuous-time Markov decision processes. TOP 14, 177–261 (2006)
Article MathSciNet MATH Google Scholar
Baier, C., Hermanns, H., Katoen, J.P., Haverkort, B.R.: Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes. Theor. Comp. Sci. 345(1), 2–26 (2005)
Article MathSciNet MATH Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley and Sons, Chichester (1994)
Book MATH Google Scholar
Baier, C., Haverkort, B.R., Hermanns, H., Katoen, J.P.: Model-checking algorithms for continuous-time Markov chains. IEEE TSE 29(6), 524–541 (2003)
MATH Google Scholar
Wolovick, N., Johr, S.: A characterization of meaningful schedulers for continuous-time Markov decision processes. In: Asarin, E., Bouyer, P. (eds.) FORMATS 2006. LNCS, vol. 4202, pp. 352–367. Springer, Heidelberg (2006)
Chapter Google Scholar
Grassmann, W.K.: Finding transient solutions in Markovian event systems through randomization. In: Stewart, W.J. (ed.) Numerical Solutions of Markov Chains, pp. 357–371 (1991)
Google Scholar
Gross, D., Miller, D.R.: The randomization technique as a modeling tool and solution procedure for transient Markov processes. Oper. Res. 32(2), 343–361 (1984)
Article MathSciNet MATH Google Scholar
Jensen, A.: Markov chains as an aid in the study of Markov processes. Skand. Aktuarietidskrift 3, 87–91 (1953)
MATH Google Scholar
Ash, R., Doléans-Dade, C.: Probability & Measure Theory, 2nd edn. Academic Press, London (2000)
MATH Google Scholar
Neuhäußer, M.R., Katoen, J.P.: Bisimulation and logical preservation for continuous-time Markov decision processes. In: Caires, L., Vasconcelos, V.T. (eds.) CONCUR 2007. LNCS, vol. 4703, pp. 412–427. Springer, Heidelberg (2007)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

MOVES Group, RWTH Aachen University, Germany
Martin R. Neuhäußer & Joost-Pieter Katoen
FMT Group, University of Twente, The Netherlands
Martin R. Neuhäußer, Mariëlle Stoelinga & Joost-Pieter Katoen

Authors

Martin R. Neuhäußer
View author publications
You can also search for this author in PubMed Google Scholar
Mariëlle Stoelinga
View author publications
You can also search for this author in PubMed Google Scholar
Joost-Pieter Katoen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, Dept. of Computer Engineering, University of California, 1156 High Street MS: SOE3, CA 95064, Santa Cruz, USA
Luca de Alfaro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Neuhäußer, M.R., Stoelinga, M., Katoen, JP. (2009). Delayed Nondeterminism in Continuous-Time Markov Decision Processes. In: de Alfaro, L. (eds) Foundations of Software Science and Computational Structures. FoSSaCS 2009. Lecture Notes in Computer Science, vol 5504. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00596-1_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-00596-1_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00595-4
Online ISBN: 978-3-642-00596-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Delayed Nondeterminism in Continuous-Time Markov Decision Processes

Abstract

Chapter PDF

Similar content being viewed by others

Trace Relations and Logical Preservation for Continuous-Time Markov Decision Processes

Cost vs. time in stochastic games and Markov automata

Optimal Continuous Time Markov Decisions

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Delayed Nondeterminism in Continuous-Time Markov Decision Processes

Abstract

Chapter PDF

Similar content being viewed by others

Trace Relations and Logical Preservation for Continuous-Time Markov Decision Processes

Cost vs. time in stochastic games and Markov automata

Optimal Continuous Time Markov Decisions

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation