Reachability in Recursive Markov Decision Processes

Brázdil, Tomáš; Brožek, Václav; Forejt, Vojtěch; Kučera, Antonín

doi:10.1007/11817949_24

Tomáš Brázdil¹⁸,
Václav Brožek¹⁸,
Vojtěch Forejt¹⁸ &
…
Antonín Kučera¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4137))

Included in the following conference series:

International Conference on Concurrency Theory

586 Accesses
7 Citations

Abstract

We consider a class of infinite-state Markov decision processes generated by stateless pushdown automata. This class corresponds to \(1 \frac{1}{2}\)-player games over graphs generated by BPA systems or (equivalently) 1-exit recursive state machines. An extended reachability objective is specified by two sets S and T of safe and terminal stack configurations, where the membership to S and T depends just on the top-of-the-stack symbol. The question is whether there is a suitable strategy such that the probability of hitting a terminal configuration by a path leading only through safe configurations is equal to (or different from) a given x ∈{0,1}. We show that the qualitative extended reachability problem is decidable in polynomial time, and that the set of all configurations for which there is a winning strategy is effectively regular. More precisely, this set can be represented by a deterministic finite-state automaton with a fixed number of control states. This result is a generalization of a recent theorem by Etessami & Yannakakis which says that the qualitative termination for 1-exit RMDPs (which exactly correspond to our \(1 \frac{1}{2}\)-player BPA games) is decidable in polynomial time. Interestingly, the properties of winning strategies for the extended reachability objectives are quite different from the ones for termination, and new observations are needed to obtain the result. As an application, we derive the EXPTIME-completeness of the model-checking problem for \(1 \frac{1}{2}\)-player BPA games and qualitative PCTL formulae.

Supported by the research center Institute for Theoretical Computer Science (ITI), project No. 1M0545.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baier, C., Kwiatkowska, M.: Model checking for a probabilistic branching time logic with fairness. Distributed Computing 11(3), 125–155 (1998)
Article Google Scholar
Bianco, A., de Alfaro, L.: Model checking of probabalistic and nondeterministic systems. In: Thiagarajan, P.S. (ed.) FSTTCS 1995. LNCS, vol. 1026, pp. 499–513. Springer, Heidelberg (1995)
Google Scholar
Brázdil, T., Kučera, A., Stražovský, O.: On the decidability of temporal properties of probabilistic pushdown automata. In: Diekert, V., Durand, B. (eds.) STACS 2005. LNCS, vol. 3404, pp. 145–157. Springer, Heidelberg (2005)
Chapter Google Scholar
Esparza, J., Kučera, A., Mayr, R.: Model-checking probabilistic pushdown automata. In: Proceedings of LICS 2004, pp. 12–21. IEEE, Los Alamitos (2004)
Google Scholar
Esparza, J., Kučera, A., Schwoon, S.: Model-checking LTL with regular valuations for pushdown systems. I&C 186(2), 355–376 (2003)
MATH Google Scholar
Etessami, K., Yannakakis, M.: Recursive Markov Decision Processes and Recursive Stochastic Games. In: Caires, L., Italiano, G.F., Monteiro, L., Palamidessi, C., Yung, M. (eds.) ICALP 2005. LNCS, vol. 3580, pp. 891–903. Springer, Heidelberg (2005)
Chapter Google Scholar
Etessami, K., Yannakakis, M.: Efficient qualitative analysis of classes of recursive Markov decision processes and simple stochastic games. In: Durand, B., Thomas, W. (eds.) STACS 2006. LNCS, vol. 3884, pp. 634–645. Springer, Heidelberg (2006)
Chapter Google Scholar
Feinberg, E., Shwartz, A. (eds.): Handbook of Markov Decision Processes. Kluwer, Dordrecht (2002)
MATH Google Scholar
Hansson, H., Jonsson, B.: A logic for reasoning about time and reliability. Formal Aspects of Computing 6, 512–535 (1994)
Article MATH Google Scholar
Hinton, A., Kwiatkowska, M., Norman, G., Parker, D.: PRISM: a tool for automatic verification of probabilistic systems. In: Hermanns, H., Palsberg, J. (eds.) TACAS 2006 and ETAPS 2006. LNCS, vol. 3920. Springer, Heidelberg (2006)
Chapter Google Scholar
Puterman, M.L.: Markov Decision Processes. Wiley, Chichester (1994)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Botanická 68a, 60200, Brno, Czech Republic
Tomáš Brázdil, Václav Brožek, Vojtěch Forejt & Antonín Kučera

Authors

Tomáš Brázdil
View author publications
You can also search for this author in PubMed Google Scholar
Václav Brožek
View author publications
You can also search for this author in PubMed Google Scholar
Vojtěch Forejt
View author publications
You can also search for this author in PubMed Google Scholar
Antonín Kučera
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Theoretical Computer Science, Technical University Dresden, Germany
Christel Baier
INRIA, VASY, Grenoble Rhône-Alpes, France
Holger Hermanns

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brázdil, T., Brožek, V., Forejt, V., Kučera, A. (2006). Reachability in Recursive Markov Decision Processes. In: Baier, C., Hermanns, H. (eds) CONCUR 2006 – Concurrency Theory. CONCUR 2006. Lecture Notes in Computer Science, vol 4137. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11817949_24

Download citation

DOI: https://doi.org/10.1007/11817949_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37376-6
Online ISBN: 978-3-540-37377-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics