Skip to main content

Stochastic Differential Games and Intricacy of Information Structures

  • Chapter
Book cover Dynamic Games in Economics

Part of the book series: Dynamic Modeling and Econometrics in Economics and Finance ((DMEF,volume 16))

Abstract

This chapter discusses, in both continuous time and discrete time, the issue of certainty equivalence in two-player zero-sum stochastic differential/dynamic games when the players have access to state information through a common noisy measurement channel. For the discrete-time case, the channel is also allowed to fail sporadically according to an independent Bernoulli process, leading to intermittent loss of measurements, where the players are allowed to observe past realizations of this process. A complete analysis of a parametrized two-stage stochastic dynamic game is conducted in terms of existence, uniqueness and characterization of saddle-point equilibria (SPE), which is shown to admit SPE of both certainty-equivalent (CE) and non-CE types, in different regions of the parameter space; for the latter, the SPE involves mixed strategies by the maximizer. The insight provided by the analysis of this game is used to obtain through an indirect approach SPE for three classes of differential/dynamic games: (i) linear-quadratic-Gaussian (LQG) zero-sum differential games with common noisy measurements, (ii) discrete-time LQG zero-sum dynamic games with common noisy measurements, and (iii) discrete-time LQG zero-sum dynamic games with intermittently missing perfect state measurements. In all cases CE is a generalized notion, requiring two separate filters for the players, even though they have a common communication channel. Discussions on extensions to other classes of stochastic games, including nonzero-sum stochastic games, and on the challenges that lie ahead conclude the chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    They are taken to be constant for simplicity in exposition; the main message of the paper and many of the expressions stand for the time-varying case as well, with some obvious modifications.

  2. 2.

    We are using “T” to denote the number of stages in the game; the same notation was used to denote transpose. These are such distinct usages that no confusion or ambiguity should arise.

  3. 3.

    We have taken α 1 not to be dependent on β 1, because when β 1=0, y 1≡0, making α 1 superfluous.

  4. 4.

    One can take any form here, since \(\hat{\gamma}\) had annihilated v, but we take linear-plus-Gaussian in anticipation of \(\hat{\gamma}\) to be in equilibrium with v.

References

  • Altman, E., Kambley, V., & Silva, A. (2009). Stochastic games with one step delay sharing information pattern with application to power control. In Proc. internat. conf. game theory for networks (GameNets’09), Turkey, Istanbul (pp. 124–129).

    Chapter  Google Scholar 

  • Bansal, R., & Başar, T. (1987). Stochastic teams with nonclassical information revisited: when is an affine law optimal? IEEE Transactions on Automatic Control, 32(6), 554–559.

    Article  Google Scholar 

  • Başar, T. (1978a). Two-criteria LQG decision problems with one-step delay observation sharing pattern. Information and Control, 38(1), 21–50.

    Article  Google Scholar 

  • Başar, T. (1978b). Decentralized multicriteria optimization of linear stochastic systems. IEEE Transactions on Automatic Control, 23(2), 233–243.

    Article  Google Scholar 

  • Başar, T. (1981). On the saddle-point solution of a class of stochastic differential games. Journal of Optimization Theory and Applications, 33(4), 539–556.

    Article  Google Scholar 

  • Başar, T. (2008). Variations on the theme of the Witsenhausen counterexample. In Proc. 47th IEEE conf. decision and control (pp. 1614–1619).

    Google Scholar 

  • Başar, T., & Bernhard, P. (1995). Systems & control: foundations and applications series. H optimal control and related minimax design problems: a dynamic game approach. Boston: Birkhäuser.

    Google Scholar 

  • Başar, T., & Mintz, M. (1972). On the existence of linear saddle-point strategies for a two-person zero-sum stochastic game. In Proc. 1972 IEEE conf. decision and control (pp. 188–192).

    Google Scholar 

  • Başar, T., & Mintz, M. (1973). A multistage pursuit-evasion game that admits a Gaussian random process as a maximin control policy. Stochastics, 1(1–4), 25–69.

    Google Scholar 

  • Başar, T., & Olsder, G. J. (1999). SIAM series in classics in applied mathematics. Dynamic non-cooperative game theory. Philadelphia: Society for Industrial and Applied Mathematics.

    Google Scholar 

  • Behn, R., & Ho, Y.-C. (1968). On a class of linear stochastic differential games. IEEE Transactions on Automatic Control, 13(3), 227–240.

    Article  Google Scholar 

  • Fleming, W. H., & Soner, H. M. (1993). Controlled Markov processes and viscosity solutions. Berlin: Springer.

    Google Scholar 

  • Hespanha, J., & Prandini, M. (2001). Nash equilibria in partial-information games on Markov chains. In Proc. 40th IEEE conf. decision and control (pp. 2102–2107).

    Google Scholar 

  • Ho, Y.-C. (1974). On the minimax principle and zero-sum stochastic differential games. Journal of Optimization Theory and Applications, 13(3), 343–361.

    Article  Google Scholar 

  • Ho, Y.-C. (1980). Team decision theory and information structures. Proceedings of the IEEE, 68(6), 644–654.

    Article  Google Scholar 

  • Nayyar, A., & Başar, T. (2012). Dynamic stochastic games with asymmetric information. In Proc. 51st IEEE conf. decision and control, Maui, Hawaii (pp. 7145–7150).

    Google Scholar 

  • Nayyar, A., Mahajan, A., & Teneketzis, D. (2011). Optimal control strategies in delayed sharing information structures. IEEE Transactions on Automatic Control, 57(7), 1606–1620.

    Article  Google Scholar 

  • Pan, Z., & Başar, T. (1995). H control of Markovian jump systems and solutions to associated piecewise-deterministic differential games. In G. J. Olsder (Ed.), Annals of dynamic games (Vol. 2, pp. 61–94). Boston: Birkhäuser.

    Google Scholar 

  • Rhodes, I., & Luenberger, D. (1969). Differential games with imperfect state information. IEEE Transactions on Automatic Control, 14(1), 29–38.

    Article  Google Scholar 

  • Shi, L., Epstein, M., & Murray, R. M. (2010). Kalman filtering over a packet-dropping network: a probabilistic perspective. IEEE Transactions on Automatic Control, 55(1), 594–604.

    Google Scholar 

  • Willman, W. (1969). Formal solutions for a class of stochastic pursuit-evasion games. IEEE Transactions on Automatic Control, 14(5), 504–509.

    Article  Google Scholar 

  • Witsenhausen, H. (1968). A counterexample in stochastic optimum control. SIAM Journal on Control, 6, 131–147.

    Article  Google Scholar 

  • Witsenhausen, H. (1971a). Separation of estimation and control for discrete time systems. Proceedings of the IEEE, 59(11), 1557–1566.

    Article  Google Scholar 

  • Witsenhausen, H. (1971b). On information structures, feedback and causality. SIAM Journal on Control, 9(2), 149–160.

    Article  Google Scholar 

  • Yüksel, S., & Başar, T. (2013). Systems & control: foundations and applications series. Stochastic networked control systems: stabilization and optimization under information constraints. Boston: Birkhäuser.

    Book  Google Scholar 

Download references

Acknowledgements

This work was supported in part by the AFOSR MURI Grant FA9550-10-1-0573, and in part by NSF under grant number CCF 11-11342.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tamer Başar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Başar, T. (2014). Stochastic Differential Games and Intricacy of Information Structures. In: Haunschmied, J., Veliov, V., Wrzaczek, S. (eds) Dynamic Games in Economics. Dynamic Modeling and Econometrics in Economics and Finance, vol 16. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54248-0_2

Download citation

Publish with us

Policies and ethics