Relative Value Iteration for Stochastic Differential Games

Arapostathis, Ari; Borkar, Vivek S.; Kumar, K. Suresh

doi:10.1007/978-3-319-02690-9_1

Ari Arapostathis⁴,
Vivek S. Borkar⁵ &
K. Suresh Kumar⁶

Part of the book series: Annals of the International Society of Dynamic Games ((AISDG,volume 13))

1107 Accesses
4 Citations

Abstract

We study zero-sum stochastic differential games with player dynamics governed by a nondegenerate controlled diffusion process. Under the assumption of uniform stability, we establish the existence of a solution to the Isaac’s equation for the ergodic game and characterize the optimal stationary strategies. The data is not assumed to be bounded, nor do we assume geometric ergodicity. Thus our results extend previous work in the literature. We also study a relative value iteration scheme that takes the form of a parabolic Isaac’s equation. Under the hypothesis of geometric ergodicity we show that the relative value iteration converges to the elliptic Isaac’s equation as time goes to infinity. We use these results to establish convergence of the relative value iteration for risk-sensitive control problems under an asymptotic flatness assumption.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arapostathis A, Borkar VS (2012) A relative value iteration algorithm for nondegenerate controlled diffusions. SIAM J Control Optim 50(4):1886–1902
Article MathSciNet MATH Google Scholar
Arapostathis A, Borkar VS, Ghosh MK (2011) Ergodic control of diffusion processes. Encyclopedia of mathematics and its applications, vol 143. Cambridge University Press, Cambridge
Book Google Scholar
Basak GK, Bhattacharya RN (1992) Stability in distribution for a class of singular diffusions. Ann Probab 20(1):312–321
Article MathSciNet MATH Google Scholar
Beneš VE (1970) Existence of optimal strategies based on specified information, for a class of stochastic decision problems. SIAM J Control 8:179–188
Article MathSciNet MATH Google Scholar
Borkar VS, Ghosh MK (1992) Stochastic differential games: occupation measure based approach. J Optim Theory Appl 73(2):359–385
Article MathSciNet MATH Google Scholar
Borkar VS, Suresh Kumar K (2010) Singular perturbations in risk-sensitive stochastic control. SIAM J Control Optim 48(6):3675–3697
Article MathSciNet MATH Google Scholar
Fleming WH, McEneaney WM (1995) Risk-sensitive control on an infinite time horizon. SIAM J Control Optim 33(6):1881–1915
Article MathSciNet MATH Google Scholar
Gilbarg D, Trudinger NS (1983) Elliptic partial differential equations of second order, 2nd edn. Grundlehren der Mathematischen Wissenschaften, vol 224. Springer, Berlin
Google Scholar
Gruber M (1984) Harnack inequalities for solutions of general second order parabolic equations and estimates of their Hölder constants. Math Z 185(1):23–43
Article MathSciNet MATH Google Scholar
Ladyženskaja OA, Solonnikov VA, Ural′ceva NN (1967) Linear and quasilinear equations of parabolic type. Translated from the Russian by S. Smith. Translations of Mathematical Monographs, Vol. 23. American Mathematical Society, Providence, R.I.
Google Scholar
Meyn SP, Tweedie RL (1993) Stability of Markovian processes. III. Foster-Lyapunov criteria for continuous-time processes. Adv Appl Probab 25(3):518–548
MathSciNet MATH Google Scholar
White DJ (1963) Dynamic programming, Markov chains, and the method of successive approximations. J Math Anal Appl 6:373–376
Article MathSciNet MATH Google Scholar
Whittle P (1990) Risk-sensitive optimal control. Wiley-Interscience Series in Systems and Optimization. Wiley, Chichester
Google Scholar

Download references

Acknowledgments

The work of Ari Arapostathis was supported in part by the Office of Naval Research under the Electric Ship Research and Development Consortium. The work of Vivek Borkar was supported in part by Grant #11IRCCSG014 from IRCC, IIT, Mumbai.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, The University of Texas at Austin, 1 University Station, Austin, TX, 78712, USA
Ari Arapostathis
Department of Electrical Engineering, Indian Institute of Technology, Powai, Mumbai, 400076, India
Vivek S. Borkar
Department of Mathematics, Indian Institute of Technology, Powai, Mumbai, 400076, India
K. Suresh Kumar

Authors

Ari Arapostathis
View author publications
You can also search for this author in PubMed Google Scholar
Vivek S. Borkar
View author publications
You can also search for this author in PubMed Google Scholar
K. Suresh Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vivek S. Borkar .

Editor information

Editors and Affiliations

Institute of Entomology Dept. of Theoretical Ecology, Biology Centre AS CR, České Budějovice, Czech Republic
Vlastimil Křivan
GERAD, HEC Montréal, Montreal, Québec, Canada
Georges Zaccour

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Arapostathis, A., Borkar, V.S., Kumar, K.S. (2013). Relative Value Iteration for Stochastic Differential Games. In: Křivan, V., Zaccour, G. (eds) Advances in Dynamic Games. Annals of the International Society of Dynamic Games, vol 13. Birkhäuser, Cham. https://doi.org/10.1007/978-3-319-02690-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-02690-9_1
Published: 18 November 2013
Publisher Name: Birkhäuser, Cham
Print ISBN: 978-3-319-02689-3
Online ISBN: 978-3-319-02690-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics