Abstract
We prove in a dynamic programming framework that uniform convergence of the finite horizon values implies that asymptotically the average accumulated payoff is constant on optimal trajectories. We analyze and discuss several possible extensions to two-person games.
Similar content being viewed by others
References
Lehrer E. and Sorin, S. (1992). A Uniform tauberian theorem in dynamic programming. Math. Oper. Res., 17, 303–307.
Monderer D. and Sorin, S. (1993). Asymptotic properties in dynamic programming. Internat. J. Game Theory, 22, 1–11.
Mertens J.-F., Sorin, S. and Zamir, S. (1994). Repeated games. CORE Discussion Papers 9420, 9421, 9422.
Oliu-barton, M. and Vigeral, G. (2009). A uniform Tauberian theorem in optimal control. Preprint.
Quincampoix, M. and Renault, J. (2009). On the existence of a limit value in some non expansive optimal control problems. Preprint.
Renault, J. (2006). The value of Markov chain repeated games with lack of information on one side. Math. Oper. Res., 31, 490–512.
Renault, J. (2007). Uniform value in dynamic programming. Cahier du CEREMADE, 2007-1.
Sorin, S. (2002). A first course on zero-sum repeated games. Mathématiques et Applications, 37, Springer.
Sorin, S. (2005). New approaches and recent advances in two-person zero-sum repeated games. In Advances in Dynamic Games, (A. Nowak and K. Szajowski, eds.). Birkhauser, 67–93.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sorin, S., Venel, X. & Vigeral, G. Asymptotic properties of optimal trajectories in dynamic programming. Sankhya 72, 237–245 (2010). https://doi.org/10.1007/s13171-010-0011-8
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13171-010-0011-8