Algorithms for the Solution of a Large-Scale Single-Controller Stochastic Game

Breton, Michèle; Hachem, Saeb El

doi:10.1007/978-1-4612-0245-5_11

Michèle Breton¹³ &
Saeb El Hachem¹⁴

Part of the book series: Annals of the International Society of Dynamic Games ((AISDG,volume 1))

315 Accesses

Abstract

In this paper, we present three algorithms for the solution of a large-scale zero-sum two-player stochastic game in discrete time, with a finite state set and continuous action spaces, where one of the players controls the transition probabilities. This particular game has an application in multi-stage stochastic decision problems.

Résumé

Dans cet article, nous proposons trois algorithmes pour la solution d’un jeu stochastique à somme nulle en temps discret, comportant un ensemble fini d’états et un espace d’actions continu. II s’agit d’un jeu dit à contrôleur unique, où l’un des joueurs décide des probabilités de transition entre états. On présente une application du jeu à des problèmes en environnement dynamique stochastique avec information incomplète sur les données.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Breton, M. and S. El-Hachem (1991), Decomposition Algorithms for Stochastic Dynamic Programs, working paper, Cahiers du GERAD G-91-15.
Google Scholar
Breton, M. and S. El-Hachem (1992), Algorithms for the Solution of Stochastic Dynamic Minimax Problems, working paper, Cahiers du GERAD G-92-32.
Google Scholar
Dupacova, J. (1987), The Minimax Approach to Stochastic Programming and Illustrative Application, Stochastics, 28, 73–88.
Article MathSciNet Google Scholar
Eckstein, J. (1989), Splitting Methods for Monotone Operators with Applications to Parallel Optimization, PhD thesis, M.I.T.
Google Scholar
Ermoliev, Y. (1966), Methods for Solving Non-Linear Extremal Problems, Cybernetics, 2, 1–17.
Article Google Scholar
Ermoliev, Y., A. Gaivoronsky and C. Nedeva (1985), Stochastic Optimization Problems with Partially Known Distribution Functions, SIAM Journal on Control and Optimization, 23, 697–716.
Article MathSciNet MATH Google Scholar
Filar, J.A. and T.E.S. Raghavan (1984), A Matrix Game Solution of the Single-Controller Stochastic Game, MOR, 9, 356–362.
Article MathSciNet MATH Google Scholar
Gabay, D. and B. Mercier (1976), A Dual Algorithm for the Solution of Nonlinear Variational Problems via Finite Element Approximations, Computers and Mathematics with Applications, 2, 17–40.
Article MATH Google Scholar
Glowinski, R. and A. Marroco (1975), Sur l‘approximation, par elements finis d’ordre un, et la résolution, par pénalisation-dualité, d’une classe de problèmes de Dirichlet non linéaires, Revue Française d’Automatique, Informatique et Recherche Opérationnelle, 9, 41–76.
MATH Google Scholar
Hiriart-Urruty, J.-B. (1980), Lipschitz r-continuity of the Approximate Sub-differential of a Convex Function, Mathematica Scandinavica, 47, 123–134.
MathSciNet MATH Google Scholar
Kallenberg, L.C.M. (1983), Linear Programming and Finite Markovian Control Problems, Mathematical Center, Amsterdam, Holland (Chapter 6).
MATH Google Scholar
Kim, S., H. Ahn and S.-C. Cho (1991). Variable Target Value Subgradient Method, Mathematical Programming, 49, 359–369.
Article MathSciNet MATH Google Scholar
Kiwiel, K.C. (1985), Methods of Descent for Nondifferentiable Optimization, Lecture Notes in Mathematics, 1133, Springer-Verlag, Berlin.
Google Scholar
Lemaréchal, C., J.-J. Strodiot and A. Bihain (1981), On a Bundle Algorithm for Non Smooth Optimization, in Nonlinear Programming 4, O.L. Mangasarian, R.R. Meyer and S.M. Robinson eds., Academic Press, New York, 245–282.
Google Scholar
Minoux, M. (1983), Programmation mathématique, théorie et algorithmes, Dunod, Paris.
MATH Google Scholar
Parthasarathy, T. and T.E.S. Raghavan (1981), An Orderfield Property for Stochastic Games when One Player Controls Transition Probabilities, Journal of Optimization Theory and Applications, 33, 375–392.
Article MathSciNet MATH Google Scholar
Polyak, B.T. (1966), A General Method for Solving Extremum Problems, Soviet Mathematics, 8, 593–597.
Google Scholar
Rockafellar, R.T. (1970), Convex Analysis, Princeton University Press, Princeton.
MATH Google Scholar
Shapley, L.S. (1953), Stochastic Games, Proceedings of the National Academy of Sciences of USA, 39, 1095–1100.
Article MathSciNet MATH Google Scholar
Vrieze, O.J. (1981), Linear Programming and Undiscounted Games in Which One Player Controls Transitions, OR Spectrum, 3, 29–35.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

École des Hautes Études Commerciales, Affiliée à l’Université de Montréal, Montréal, Quebec, Canada, H3T 1V6
Michèle Breton
GERAD, 5255 Av. Decelles, Montreal, Quebec, Canada, H3T 1V6
Saeb El Hachem

Authors

Michèle Breton
View author publications
You can also search for this author in PubMed Google Scholar
Saeb El Hachem
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Coordinated Science Laboratory, University of Illinois, Urbana, IL, 61801, USA
Tamer Başar
Department of Management Studies, University of Geneva, CH-1211, Geneva, Switzerland
Alain Haurie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Breton, M., Hachem, S.E. (1994). Algorithms for the Solution of a Large-Scale Single-Controller Stochastic Game. In: Başar, T., Haurie, A. (eds) Advances in Dynamic Games and Applications. Annals of the International Society of Dynamic Games, vol 1. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-0245-5_11

Download citation

DOI: https://doi.org/10.1007/978-1-4612-0245-5_11
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-6679-2
Online ISBN: 978-1-4612-0245-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics