Asymptotic properties of optimal trajectories in dynamic programming

Sorin, Sylvain; Venel, Xavier; Vigeral, Guillaume

doi:10.1007/s13171-010-0011-8

Asymptotic properties of optimal trajectories in dynamic programming

Published: 17 June 2010

Volume 72, pages 237–245, (2010)
Cite this article

Sankhya A Aims and scope Submit manuscript

Sylvain Sorin^1,4,
Xavier Venel^2,5 &
Guillaume Vigeral^3,6

64 Accesses
7 Citations
Explore all metrics

Abstract

We prove in a dynamic programming framework that uniform convergence of the finite horizon values implies that asymptotically the average accumulated payoff is constant on optimal trajectories. We analyze and discuss several possible extensions to two-person games.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Bottom of the Spectrum of Time-Changed Processes and the Maximum Principle of Schrödinger Operators

Article 12 January 2017

Conservative and Semiconservative Random Walks: Recurrence and Transience

Article 27 February 2017

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Article 17 January 2019

References

Lehrer E. and Sorin, S. (1992). A Uniform tauberian theorem in dynamic programming. Math. Oper. Res., 17, 303–307.
Article MathSciNet Google Scholar
Monderer D. and Sorin, S. (1993). Asymptotic properties in dynamic programming. Internat. J. Game Theory, 22, 1–11.
Article MATH MathSciNet Google Scholar
Mertens J.-F., Sorin, S. and Zamir, S. (1994). Repeated games. CORE Discussion Papers 9420, 9421, 9422.
Oliu-barton, M. and Vigeral, G. (2009). A uniform Tauberian theorem in optimal control. Preprint.
Quincampoix, M. and Renault, J. (2009). On the existence of a limit value in some non expansive optimal control problems. Preprint.
Renault, J. (2006). The value of Markov chain repeated games with lack of information on one side. Math. Oper. Res., 31, 490–512.
Article MATH MathSciNet Google Scholar
Renault, J. (2007). Uniform value in dynamic programming. Cahier du CEREMADE, 2007-1.
Sorin, S. (2002). A first course on zero-sum repeated games. Mathématiques et Applications, 37, Springer.
Sorin, S. (2005). New approaches and recent advances in two-person zero-sum repeated games. In Advances in Dynamic Games, (A. Nowak and K. Szajowski, eds.). Birkhauser, 67–93.

Download references

Author information

Authors and Affiliations

UPMC-Paris 6, Paris, France
Sylvain Sorin
Université de Toulouse 1, Toulouse, France
Xavier Venel
Ecole Polytechnique, Palaiseau, France
Guillaume Vigeral
Equipe Combinatoire et Optimisation CNRS FRE 3232, Faculté de Mathématiques, UPMC-Paris 6, 175 Rue du Chevaleret, 75013, Paris, France
Sylvain Sorin
Aile J.J. Laffont, GREMAQ Université de Toulouse 1 Manufacture des Tabacs, 21 allée de Brienne, 31000, Toulouse, France
Xavier Venel
Ecole Polytechnique route de Saclay, INRIA Saclay — Ile-de-France and CMAP, 91128, Palaiseau cedex, France
Guillaume Vigeral

Authors

Sylvain Sorin
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Venel
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Vigeral
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sylvain Sorin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sorin, S., Venel, X. & Vigeral, G. Asymptotic properties of optimal trajectories in dynamic programming. Sankhya 72, 237–245 (2010). https://doi.org/10.1007/s13171-010-0011-8

Download citation

Received: 15 October 2009
Revised: 15 January 2010
Published: 17 June 2010
Issue Date: February 2010
DOI: https://doi.org/10.1007/s13171-010-0011-8

AMS (2000) subject classification

Primary 49L20, 90C39, 91A15

Keywords and phrases

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Asymptotic properties of optimal trajectories in dynamic programming

Abstract

Access this article

Similar content being viewed by others

The Bottom of the Spectrum of Time-Changed Processes and the Maximum Principle of Schrödinger Operators

Conservative and Semiconservative Random Walks: Recurrence and Transience

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

AMS (2000) subject classification

Keywords and phrases

Navigation

Asymptotic properties of optimal trajectories in dynamic programming

Abstract

Access this article

Similar content being viewed by others

The Bottom of the Spectrum of Time-Changed Processes and the Maximum Principle of Schrödinger Operators

Conservative and Semiconservative Random Walks: Recurrence and Transience

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

AMS (2000) subject classification

Keywords and phrases

Search

Navigation