Multistage Games

Krawczyk, Jacek B.; Petkov, Vladimir

doi:10.1007/978-3-319-44374-4_3

Multistage Games

Jacek B. Krawczyk³ &
Vladimir Petkov³

Reference work entry
First Online: 02 June 2018

3822 Accesses
4 Citations

Abstract

In this chapter, we build on the concept of a repeated game and introduce the notion of a multistage game . In both types of games, several antagonistic agents interact with each other over time. The difference is that, in a multistage game, there is a dynamic system whose state keeps changing: the controls chosen by the agents in the current period affect the system’s future. In contrast with repeated games, the agents’ payoffs in multistage games depend directly on the state of this system. Examples of such settings range from a microeconomic dynamic model of a fish biomass exploited by several agents to a macroeconomic interaction between the government and the business sector. In some multistage games, physically different decision-makers engage in simultaneous-move competition. In others, agents execute their actions sequentially rather than simultaneously. We also study hierarchical games, where a leader moves ahead of a follower. The chapter concludes with an example of memory-based strategies that can support Pareto-efficient outcomes.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 699.99; Price excludes VAT (USA)

Hardcover Book: USD 899.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
In the sections where we deal with dynamic systems described by multiple state equations, we adopt a notation where vectors and matrices are in boldface style to distinguish them from scalars that are in regular style.
2.
In stochastic systems, some “controls” may come from nature and are thus independent of other players’ actions.
3.
We should also note that, when the time horizon is infinite, it is usually assumed that the system is stationary . That is, the reward and state transition functions do not depend explicitly on time t.
4.
This limit is known as Cesaro limit .
5.
In the most general formulation of the problem, the control constraint sets may depend on time and the current state, i.e., \(U_{j}(t,x)\subset \mathbb {R}^{m_{j}}\). Moreover, sometimes the state may be constrained to remain in a subset \(\mathbf {X}\subset \mathbb {R} ^{n}\). We avoid these complications here.
6.
For example, the anti-block braking systems (ABS) used in cars, the automatic landing systems of aircrafts, etc.
7.
Commitments (agreements, treaties, schedules, planning processes, etc.) may force the agents to use the open-loop control even if state observations are available. On the other hand, some state variables (like the quantity of fish biomass in a management model for an ocean fishery) cannot be easily observable. In such cases, the agents may try to establish feedback controls using proxy variables, e.g., fish prices on a particular market.
8.
An information structure of this type is known as piecewise open-loop control .
9.
Also see equations (4.35)–(4.36) later in the chapter.
10.
We note that expressing payoffs as functions of players’ strategies is necessary for a game definition in normal form.
11.
To simplify notations, from now on we will omit the superscript ^T and refer to \(\tilde {\mathbf {u}}_j\) instead of \(\tilde {\mathbf {u}}_j^T\) or \(\tilde {\mathbf {x}}\) instead of \(\tilde {\mathbf {x}}^T\).
12.
Also called adjoint vector. This terminology is borrowed from optimal control theory.
13.
If a feedback strategy pair \( \underline {{\boldsymbol {\sigma }}}(t,\mathbf {x})\) is continuous in t and its partial derivatives \(\frac {\partial }{\partial \mathbf {x}} \underline {{\boldsymbol {\sigma }}}(t,\mathbf {x})\) exist and are continuous, then it is possible to characterize a feedback-Nash equilibrium through a coupled maximum principle (see Haurie et al. 2012).
14.
For notational simplicity, we still use J_j to designate this game’s normal form payoffs.
15.
In a stochastic context, perhaps counterintuitively, certain multistage ( supermodular) games defined on lattices admit feedback equilibria which can be established via a fixed-point theorem due to Tarski. See Haurie et al. (2012) and the references provided there.
16.
For this reason, it is also sometimes referred to as strong time consistency; see Başar and Olsder (1999) and Başar (1989).
17.
This notation helps generalize our results. They would formally be unchanged if there were m > 2 players. In that case, − j would refer to the m − 1 opponents of Player j.
18.
We note that the functions \(W_{j}^*(\cdots )\) and \(W_{-j}^*(\cdots )\) are continuation payoffs . Compare Sect. 6.1.
19.
Recall that an equilibrium is a fixed point of a best-reply function, and that a fixed point requires some regularity to exist.
20.
The satisfaction of these conditions guarantees that such an equilibrium is feedback-Nash, or Markovian, equilibrium .
21.
We observe that feedback-Nash equilibrium is also a Nash equilibrium for the normal (strategic) form of the game. In that case it is not defined recursively, but using state-feedback control laws as strategies, see Başar and Olsder (1999) and Başar (1989).
22.
If it were possible to show that, at every stage, the local games are diagonally strictly concave (see e.g., Krawczyk and Tidball 2006), then one can guarantee that a unique equilibrium exists \( \underline {{\sigma }}(t,\mathbf {x})\equiv (\sigma _{j}(t, \mathbf {x}(t)),\sigma _{-j}(t,\mathbf {x}(t)))\,.{}\) However, it turns out that diagonal strict concavity for a game at t does not generally imply that the game at t − 1 possesses this feature.
23.
The system’s dynamics with a linear law of motion is an example of a BIDE model (Birth, Immigration, Death, Emigration model, see, e.g., Fahrig 2002 and Pulliam 1988).
24.
To solve a functional equation like (4.60), we use the method of undetermined coefficients ; see Haurie et al. (2012) or any textbook on difference equations.
25.
Any linear growth model in which capital expands proportionally to the growth coefficient a is called an AK model.
26.
Alternatively, we could postulate that the cost of adjusting output in the subsequent period is infinite.
27.
The announced strategy should be implemented. However, the leader could deviate from that strategy.
28.
That is, with a finite number of states. While games described by a state equation are typically infinite, matrix games are always finite.
29.
Source DeFreitas and Marshall (1998).
30.
In fact, the coefficients \(\underline {c}\), \( \underline {w}\), and Γ will all depend on α. However, to simplify notation, we will keep these symbols nonindexed.
31.
Where all \({ \underline {c}}^{\alpha },{ \underline {w}}^{\alpha },\Gamma _{c},\Gamma _{w}\) depend on β and α; see pages 203 and 206.

References

Barro RJ (1999) Ramsey meets Laibson in the neoclassical growth model. Q J Econ 114: 1125–1152
Google Scholar
Başar T (1989) Time consistency and robustness of equilibria in noncooperative dynamic games. In: van der Ploeg F, de Zeeuw AJ (eds) Dynamic policy games in economics. North Holland, Amsterdam
Google Scholar
Başar T, Olsder GJ (1982) Dynamic noncooperative game theory. Academic Press, New York
Google Scholar
Başar T, Olsder GJ (1999) Dynamic noncooperative game theory. SIAM series in Classics in Applied Mathematics. SIAM, Philadelphia
Google Scholar
Bellman R (1957) Dynamic programming. Princeton University Press, Princeton
Google Scholar
Clark CW (1976) Mathematical bioeconomics. Wiley-Interscience, New York
Google Scholar
DeFreitas G, Marshall A (1998) Labour surplus, worker rights and productivity growth: a comparative analysis of Asia and Latin America. Labour 12(3):515–539
Google Scholar
Diamond P, Köszegi B (2003) Quasi-hyperbolic discounting and retirement. J Public Econ 87(9):1839–1872
Google Scholar
Fahrig L (2002) Effect of habitat fragmentation on the extinction threshold: a synthesis*. Ecol Appl 12(2):346–353
Google Scholar
Fan LT, Wang CS (1964) The discrete maximum principle. John Wiley and Sons, New York
Google Scholar
Fudenberg D, Tirole J (1991) Game theory. The MIT Press, Cambridge/London
Google Scholar
Gruber J, Koszegi B (2001) Is addiction rational? Theory and evidence. Q J Econ 116(4):1261
Google Scholar
Halkin H (1974) Necessary conditions for optimal control problems with infinite horizons. Econom J Econom Soc 42:267–272
Google Scholar
Haurie A, Krawczyk JB, Zaccour G (2012) Games and dynamic games. Business series, vol 1. World Scientific/Now Publishers, Singapore/Hackensack
Google Scholar
Kaldor N (1961) Capital accumulation and economic growth. In: Lutz F, Hague DC (eds) Proceedings of a conference held by the international economics association. McMillan, London, pp 177–222
Google Scholar
Kocherlakota NR (2001) Looking for evidence of time-inconsistent preferences in asset market data. Fed Reserve Bank Minneap Q Rev-Fed Reserve Bank Minneap 25(3):13
Google Scholar
Krawczyk JB, Shimomura K (2003) Why countries with the same technology and preferences can have different growth rates. J Econ Dyn Control 27(10):1899–1916
Google Scholar
Krawczyk JB, Tidball M (2006) A discrete-time dynamic game of seasonal water allocation. J Optim Theory Appl 128(2):411–429
Google Scholar
Laibson DI (1996) Hyperbolic discount functions, undersaving, and savings policy. Technical report, National Bureau of Economic Research
Google Scholar
Laibson D (1997) Golden eggs and hyperbolic discounting. Q J Econ 112(2):443–477
Google Scholar
Levhari D, Mirman LJ (1980) The great fish war: an example using a dynamic Cournot-Nash solution. Bell J Econ 11(1):322–334
Google Scholar
Long NV (1977) Optimal exploitation and replenishment of a natural resource. In: Pitchford J, Turnovsky S (eds) Applications of control theory in economic analysis. North-Holland, Amsterdam, pp 81–106
Google Scholar
Luenberger DG (1969) Optimization by vector space methods. John Wiley & Sons, New York
Google Scholar
Luenberger DG (1979) Introduction to dynamic systems: theory, models & applications. John Wiley & Sons, New York
Google Scholar
Maskin E, Tirole J (1987) A theory of dynamic oligopoly, iii: Cournot competition. Eur Econ Rev 31(4):947–968
Google Scholar
Michel P (1982) On the transversality condition in infinite horizon optimal problems. Econometrica 50:975–985
Google Scholar
O’Donoghue T, Rabin M (1999) Doing it now or later. Am Econ Rev 89(1):103–124
Google Scholar
Paserman MD (2008) Job search and hyperbolic discounting: structural estimation and policy evaluation. Econ J 118(531):1418–1452
Google Scholar
Phelps ES, Pollak RA (1968) On second-best national saving and game-equilibrium growth. Rev Econ Stud 35(2):185–199
Google Scholar
Pulliam HR (1988) Sources, sinks, and population regulation. Am Nat 652–661
Google Scholar
Romer PM (1986) Increasing returns and long-run growth. J Polit Econ 94(5):1002–1037
Article MathSciNet Google Scholar
Samuelson PA (1937) A note on measurement of utility. Rev Econ Stud 4(2):155–161
Article MathSciNet Google Scholar
Selten R (1975) Rexaminition of the perfectness concept for equilibrium points in extensive games. Int J Game Theory 4(1):25–55
Article Google Scholar
Simaan M, Cruz JB (1973) Additional aspects of the Stackelberg strategy in nonzero-sum games. J Optim Theory Appl 11:613–626
Article MathSciNet Google Scholar
Spencer BJ, Brander JA (1983a) International R&D rivalry and industrial strategy. Working Paper 1192, National Bureau of Economic Research, http://www.nber.org/papers/w1192
Spencer BJ, Brander JA (1983b) Strategic commitment with R&D: the symmetric case. Bell J Econ 14(1):225–235
Article Google Scholar
Strotz RH (1955) Myopia and inconsistency in dynamic utility maximization. Rev Econ Stud 23(3):165–180
Article Google Scholar

Download references

Author information

Authors and Affiliations

Victoria University of Wellington, Wellington, New Zealand
Jacek B. Krawczyk & Vladimir Petkov

Authors

Jacek B. Krawczyk
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Petkov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jacek B. Krawczyk .

Editor information

Editors and Affiliations

Coordinated Science Laboratory and Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Tamer Başar
Department of Decision Sciences GERAD, HEC Montréal, Montreal, Québec, Canada
Georges Zaccour

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Krawczyk, J.B., Petkov, V. (2018). Multistage Games. In: Başar, T., Zaccour, G. (eds) Handbook of Dynamic Game Theory. Springer, Cham. https://doi.org/10.1007/978-3-319-44374-4_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-44374-4_3
Published: 02 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44373-7
Online ISBN: 978-3-319-44374-4
eBook Packages: Mathematics and StatisticsReference Module Computer Science and Engineering

Publish with us

Policies and ethics