Abstract
In this paper, we present three algorithms for the solution of a large-scale zero-sum two-player stochastic game in discrete time, with a finite state set and continuous action spaces, where one of the players controls the transition probabilities. This particular game has an application in multi-stage stochastic decision problems.
Résumé
Dans cet article, nous proposons trois algorithmes pour la solution d’un jeu stochastique à somme nulle en temps discret, comportant un ensemble fini d’états et un espace d’actions continu. II s’agit d’un jeu dit à contrôleur unique, où l’un des joueurs décide des probabilités de transition entre états. On présente une application du jeu à des problèmes en environnement dynamique stochastique avec information incomplète sur les données.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Breton, M. and S. El-Hachem (1991), Decomposition Algorithms for Stochastic Dynamic Programs, working paper, Cahiers du GERAD G-91-15.
Breton, M. and S. El-Hachem (1992), Algorithms for the Solution of Stochastic Dynamic Minimax Problems, working paper, Cahiers du GERAD G-92-32.
Dupacova, J. (1987), The Minimax Approach to Stochastic Programming and Illustrative Application, Stochastics, 28, 73–88.
Eckstein, J. (1989), Splitting Methods for Monotone Operators with Applications to Parallel Optimization, PhD thesis, M.I.T.
Ermoliev, Y. (1966), Methods for Solving Non-Linear Extremal Problems, Cybernetics, 2, 1–17.
Ermoliev, Y., A. Gaivoronsky and C. Nedeva (1985), Stochastic Optimization Problems with Partially Known Distribution Functions, SIAM Journal on Control and Optimization, 23, 697–716.
Filar, J.A. and T.E.S. Raghavan (1984), A Matrix Game Solution of the Single-Controller Stochastic Game, MOR, 9, 356–362.
Gabay, D. and B. Mercier (1976), A Dual Algorithm for the Solution of Nonlinear Variational Problems via Finite Element Approximations, Computers and Mathematics with Applications, 2, 17–40.
Glowinski, R. and A. Marroco (1975), Sur l‘approximation, par elements finis d’ordre un, et la résolution, par pénalisation-dualité, d’une classe de problèmes de Dirichlet non linéaires, Revue Française d’Automatique, Informatique et Recherche Opérationnelle, 9, 41–76.
Hiriart-Urruty, J.-B. (1980), Lipschitz r-continuity of the Approximate Sub-differential of a Convex Function, Mathematica Scandinavica, 47, 123–134.
Kallenberg, L.C.M. (1983), Linear Programming and Finite Markovian Control Problems, Mathematical Center, Amsterdam, Holland (Chapter 6).
Kim, S., H. Ahn and S.-C. Cho (1991). Variable Target Value Subgradient Method, Mathematical Programming, 49, 359–369.
Kiwiel, K.C. (1985), Methods of Descent for Nondifferentiable Optimization, Lecture Notes in Mathematics, 1133, Springer-Verlag, Berlin.
Lemaréchal, C., J.-J. Strodiot and A. Bihain (1981), On a Bundle Algorithm for Non Smooth Optimization, in Nonlinear Programming 4, O.L. Mangasarian, R.R. Meyer and S.M. Robinson eds., Academic Press, New York, 245–282.
Minoux, M. (1983), Programmation mathématique, théorie et algorithmes, Dunod, Paris.
Parthasarathy, T. and T.E.S. Raghavan (1981), An Orderfield Property for Stochastic Games when One Player Controls Transition Probabilities, Journal of Optimization Theory and Applications, 33, 375–392.
Polyak, B.T. (1966), A General Method for Solving Extremum Problems, Soviet Mathematics, 8, 593–597.
Rockafellar, R.T. (1970), Convex Analysis, Princeton University Press, Princeton.
Shapley, L.S. (1953), Stochastic Games, Proceedings of the National Academy of Sciences of USA, 39, 1095–1100.
Vrieze, O.J. (1981), Linear Programming and Undiscounted Games in Which One Player Controls Transitions, OR Spectrum, 3, 29–35.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer Science+Business Media New York
About this paper
Cite this paper
Breton, M., Hachem, S.E. (1994). Algorithms for the Solution of a Large-Scale Single-Controller Stochastic Game. In: Başar, T., Haurie, A. (eds) Advances in Dynamic Games and Applications. Annals of the International Society of Dynamic Games, vol 1. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-0245-5_11
Download citation
DOI: https://doi.org/10.1007/978-1-4612-0245-5_11
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-6679-2
Online ISBN: 978-1-4612-0245-5
eBook Packages: Springer Book Archive