Stochastic Differential Games and Intricacy of Information Structures

Başar, Tamer

doi:10.1007/978-3-642-54248-0_2

Tamer Başar⁶

Part of the book series: Dynamic Modeling and Econometrics in Economics and Finance ((DMEF,volume 16))

1900 Accesses
4 Citations

Abstract

This chapter discusses, in both continuous time and discrete time, the issue of certainty equivalence in two-player zero-sum stochastic differential/dynamic games when the players have access to state information through a common noisy measurement channel. For the discrete-time case, the channel is also allowed to fail sporadically according to an independent Bernoulli process, leading to intermittent loss of measurements, where the players are allowed to observe past realizations of this process. A complete analysis of a parametrized two-stage stochastic dynamic game is conducted in terms of existence, uniqueness and characterization of saddle-point equilibria (SPE), which is shown to admit SPE of both certainty-equivalent (CE) and non-CE types, in different regions of the parameter space; for the latter, the SPE involves mixed strategies by the maximizer. The insight provided by the analysis of this game is used to obtain through an indirect approach SPE for three classes of differential/dynamic games: (i) linear-quadratic-Gaussian (LQG) zero-sum differential games with common noisy measurements, (ii) discrete-time LQG zero-sum dynamic games with common noisy measurements, and (iii) discrete-time LQG zero-sum dynamic games with intermittently missing perfect state measurements. In all cases CE is a generalized notion, requiring two separate filters for the players, even though they have a common communication channel. Discussions on extensions to other classes of stochastic games, including nonzero-sum stochastic games, and on the challenges that lie ahead conclude the chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
They are taken to be constant for simplicity in exposition; the main message of the paper and many of the expressions stand for the time-varying case as well, with some obvious modifications.
2.
We are using “T” to denote the number of stages in the game; the same notation was used to denote transpose. These are such distinct usages that no confusion or ambiguity should arise.
3.
We have taken α ₁ not to be dependent on β ₁, because when β ₁=0, y ₁≡0, making α ₁ superfluous.
4.
One can take any form here, since \(\hat{\gamma}\) had annihilated v, but we take linear-plus-Gaussian in anticipation of \(\hat{\gamma}\) to be in equilibrium with v.

References

Altman, E., Kambley, V., & Silva, A. (2009). Stochastic games with one step delay sharing information pattern with application to power control. In Proc. internat. conf. game theory for networks (GameNets’09), Turkey, Istanbul (pp. 124–129).
Chapter Google Scholar
Bansal, R., & Başar, T. (1987). Stochastic teams with nonclassical information revisited: when is an affine law optimal? IEEE Transactions on Automatic Control, 32(6), 554–559.
Article Google Scholar
Başar, T. (1978a). Two-criteria LQG decision problems with one-step delay observation sharing pattern. Information and Control, 38(1), 21–50.
Article Google Scholar
Başar, T. (1978b). Decentralized multicriteria optimization of linear stochastic systems. IEEE Transactions on Automatic Control, 23(2), 233–243.
Article Google Scholar
Başar, T. (1981). On the saddle-point solution of a class of stochastic differential games. Journal of Optimization Theory and Applications, 33(4), 539–556.
Article Google Scholar
Başar, T. (2008). Variations on the theme of the Witsenhausen counterexample. In Proc. 47th IEEE conf. decision and control (pp. 1614–1619).
Google Scholar
Başar, T., & Bernhard, P. (1995). Systems & control: foundations and applications series. H ^∞ optimal control and related minimax design problems: a dynamic game approach. Boston: Birkhäuser.
Google Scholar
Başar, T., & Mintz, M. (1972). On the existence of linear saddle-point strategies for a two-person zero-sum stochastic game. In Proc. 1972 IEEE conf. decision and control (pp. 188–192).
Google Scholar
Başar, T., & Mintz, M. (1973). A multistage pursuit-evasion game that admits a Gaussian random process as a maximin control policy. Stochastics, 1(1–4), 25–69.
Google Scholar
Başar, T., & Olsder, G. J. (1999). SIAM series in classics in applied mathematics. Dynamic non-cooperative game theory. Philadelphia: Society for Industrial and Applied Mathematics.
Google Scholar
Behn, R., & Ho, Y.-C. (1968). On a class of linear stochastic differential games. IEEE Transactions on Automatic Control, 13(3), 227–240.
Article Google Scholar
Fleming, W. H., & Soner, H. M. (1993). Controlled Markov processes and viscosity solutions. Berlin: Springer.
Google Scholar
Hespanha, J., & Prandini, M. (2001). Nash equilibria in partial-information games on Markov chains. In Proc. 40th IEEE conf. decision and control (pp. 2102–2107).
Google Scholar
Ho, Y.-C. (1974). On the minimax principle and zero-sum stochastic differential games. Journal of Optimization Theory and Applications, 13(3), 343–361.
Article Google Scholar
Ho, Y.-C. (1980). Team decision theory and information structures. Proceedings of the IEEE, 68(6), 644–654.
Article Google Scholar
Nayyar, A., & Başar, T. (2012). Dynamic stochastic games with asymmetric information. In Proc. 51st IEEE conf. decision and control, Maui, Hawaii (pp. 7145–7150).
Google Scholar
Nayyar, A., Mahajan, A., & Teneketzis, D. (2011). Optimal control strategies in delayed sharing information structures. IEEE Transactions on Automatic Control, 57(7), 1606–1620.
Article Google Scholar
Pan, Z., & Başar, T. (1995). H ^∞ control of Markovian jump systems and solutions to associated piecewise-deterministic differential games. In G. J. Olsder (Ed.), Annals of dynamic games (Vol. 2, pp. 61–94). Boston: Birkhäuser.
Google Scholar
Rhodes, I., & Luenberger, D. (1969). Differential games with imperfect state information. IEEE Transactions on Automatic Control, 14(1), 29–38.
Article Google Scholar
Shi, L., Epstein, M., & Murray, R. M. (2010). Kalman filtering over a packet-dropping network: a probabilistic perspective. IEEE Transactions on Automatic Control, 55(1), 594–604.
Google Scholar
Willman, W. (1969). Formal solutions for a class of stochastic pursuit-evasion games. IEEE Transactions on Automatic Control, 14(5), 504–509.
Article Google Scholar
Witsenhausen, H. (1968). A counterexample in stochastic optimum control. SIAM Journal on Control, 6, 131–147.
Article Google Scholar
Witsenhausen, H. (1971a). Separation of estimation and control for discrete time systems. Proceedings of the IEEE, 59(11), 1557–1566.
Article Google Scholar
Witsenhausen, H. (1971b). On information structures, feedback and causality. SIAM Journal on Control, 9(2), 149–160.
Article Google Scholar
Yüksel, S., & Başar, T. (2013). Systems & control: foundations and applications series. Stochastic networked control systems: stabilization and optimization under information constraints. Boston: Birkhäuser.
Book Google Scholar

Download references

Acknowledgements

This work was supported in part by the AFOSR MURI Grant FA9550-10-1-0573, and in part by NSF under grant number CCF 11-11342.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Coordinated Science Laboratory, University of Illinois, Urbana, IL, USA
Tamer Başar

Authors

Tamer Başar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tamer Başar .

Editor information

Editors and Affiliations

Institute of Mathematical Methods in Economics, Vienna University of Technology, Vienna, Austria
Josef Haunschmied
Institute of Mathematical Methods in Economics, Vienna University of Technology, Vienna, Austria
Vladimir M. Veliov
Department of Business Administraton, University of Vienna, Vienna, Austria
Stefan Wrzaczek

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Başar, T. (2014). Stochastic Differential Games and Intricacy of Information Structures. In: Haunschmied, J., Veliov, V., Wrzaczek, S. (eds) Dynamic Games in Economics. Dynamic Modeling and Econometrics in Economics and Finance, vol 16. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54248-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-54248-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54247-3
Online ISBN: 978-3-642-54248-0
eBook Packages: Business and EconomicsEconomics and Finance (R0)

Publish with us

Policies and ethics