Abstract
In this chapter the state and action spaces are assumed to be separable metric spaces. We apply the measurable selection theorem of Kuratowski/Ryll-Nardzewski in order to prove the existence of maximizers. We present different sets of assumptions under which optimal policies for MDPs exist.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dudley, R. M. (1989). Real analysis and probability. Pacific Grove: Wadsworth & Brooks/Cole Advanced Books & Software.
Hernández-Lerma, O., & Lasserre, J. B. (1999). Further topics on discrete-time Markov control processes. New York: Springer.
Kuratowski, K., & Ryll-Nardzewski, C. (1965). A general theorem on selectors. Bulletin de L’Académie Polonaise Des Sciences: Série des Sciences Mathématiques, Astronomiques et Physiques, 13, 397–403.
Rieder, U. (1978). Measurable selection theorems for optimization problems. Manuscripta Mathematica, 24, 115–131.
Schäl, M. (1975). Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 32, 179–196.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this chapter
Cite this chapter
Hinderer, K., Rieder, U., Stieglitz, M. (2016). Existence of Optimal Policies. In: Dynamic Optimization. Universitext. Springer, Cham. https://doi.org/10.1007/978-3-319-48814-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-48814-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48813-4
Online ISBN: 978-3-319-48814-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)