No Regret Learning in Oligopolies: Cournot vs. Bertrand

Nadav, Uri; Piliouras, Georgios

doi:10.1007/978-3-642-16170-4_26

Uri Nadav¹⁹ &
Georgios Piliouras²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6386))

Included in the following conference series:

International Symposium on Algorithmic Game Theory

1754 Accesses
13 Citations

Abstract

Cournot and Bertrand oligopolies constitute the two most prevalent models of firm competition. The analysis of Nash equilibria in each model reveals a unique prediction about the stable state of the system. Quite alarmingly, despite the similarities of the two models, their projections expose a stark dichotomy. Under the Cournot model, where firms compete by strategically managing their output quantity, firms enjoy positive profits as the resulting market prices exceed that of the marginal costs. On the contrary, the Bertrand model, in which firms compete on price, predicts that a duopoly is enough to push prices down to the marginal cost level. This suggestion that duopoly will result in perfect competition, is commonly referred to in the economics literature as the “Bertrand paradox”.

In this paper, we move away from the safe haven of Nash equilibria as we analyze these models in disequilibrium under minimal behavioral hypotheses. Specifically, we assume that firms adapt their strategies over time, so that in hindsight their average payoffs are not exceeded by any single deviating strategy. Given this no-regret guarantee, we show that in the case of Cournot oligopolies, the unique Nash equilibrium fully captures the emergent behavior. Notably, we prove that under natural assumptions the daily market characteristics converge to the unique Nash. In contrast, in the case of Bertrand oligopolies, a wide range of positive average payoff profiles can be sustained. Hence, under the assumption that firms have no-regret the Bertrand paradox is resolved and both models arrive to the same conclusion that increased competition is necessary in order to achieve perfect pricing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abernethy, J., Hazan, E., Rakhlin, A.: Competing in the dark: An efficient algorithm for bandit linear optimization. In: COLT (2008)
Google Scholar
Amir, R.: Cournot oligopoly and the theory of supermodular games. Games and Economic Behavior (1996)
Google Scholar
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM Journal on Computing (2002)
Google Scholar
Baye, M., Morgan, J.: A folk theorem for one-shot bertrand games. Economic Letters 65(1), 59–65 (1999)
Article MathSciNet MATH Google Scholar
Bertrand, J.: Theorie mathematique de la richesse sociale. Journal des Savants 67, 499–508 (1883)
Google Scholar
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning and Games. Cambridge University Press, Cambridge (2006)
Book MATH Google Scholar
Cournot, A.A.: Recherches sur les principes mathmatiques de la thorie des richesses (1838)
Google Scholar
Dufwenberg, M., Gneezy, U.: Price competition and market concentration: An experimental study. International Journal of Industrial Organization 18 (2000)
Google Scholar
Even-Dar, E., Mansour, Y., Nadav, U.: On the convergence of regret minimization dynamics in concave games. In: 41st ACM Symposium on Theory of Computing, STOC (2009)
Google Scholar
Flaxman, A., Kalai, A.T., McMahan, B.: Online convex optimization in the bandit setting: Gradient descent without a gradient. In: SODA (2005)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1), 119–139 (1997)
Article MathSciNet MATH Google Scholar
Fudenberg, D., Levine, D.K.: The theory of learning in games. MIT Press, Cambridge (1998)
MATH Google Scholar
Hazan, E., Agarwal, A., Kale, S.: Logarithmic regret algorithms for online convex optimization. Machine Learning 69(2-3), 169–192 (2007)
Article MATH Google Scholar
Liu, L.: Correlated equilibrium of cournot oligopoly competition. Journal of Economic Theory (1996)
Google Scholar
Mas-Colell, A., Winston, M., Green, J.: Microeconomic Theory. Oxford University Press, Oxford (1995)
MATH Google Scholar
Milgrom, P., Roberts, J.: Rationalizability, learning, and equilibrium in games with strategic complementarities. Econometrica 58(6), 1255–1277 (1990)
Article MathSciNet MATH Google Scholar
Milgrom, P., Roberts, J.: Adaptive and sophisticated learning in repeated normal form games. In: Games and Economic Behavior, pp. 82–100 (February 1991)
Google Scholar
Rosen, J.: Existense and uniqueness of equilibrium points for concave n-person games
Google Scholar
Sheth, J.N., Sisodia, R.S.: The Rule of Three: Surviving and Thriving in Competitive Markets
Google Scholar
Theocharis, R.D.: On the stability of the cournot solution on the oligopoly problem. The Review of Economic Studies 27(2), 133–134 (1960)
Article Google Scholar
Vives, X.: Oligopoly Pricing: Old Ideas and New Tools. The MIT Press, Cambridge (2001)
Google Scholar
Wu, J.: Correlated equilibrium of bertrand competition. In: Papadimitriou, C., Zhang, S. (eds.) WINE 2008. LNCS, vol. 5385, pp. 166–177. Springer, Heidelberg (2008)
Chapter Google Scholar
Yi, S.-S.: On the existence of a unique correlated equilibrium in cournot oligopoly. Economic Letters 54, 235–239 (1997)
Article MathSciNet MATH Google Scholar
Young, H.: Strategic Learning and Its Limits. Oxford University Press, Oxford (2004)
Book Google Scholar
Zinkevich, M.: Online convex programming and generalized infitesimal gradient ascent. In: ICML (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Stanford University, Stanford, CA, 94305
Uri Nadav
Department of Computer Science, Cornell University, Ithaca, NY, 14853
Georgios Piliouras

Authors

Uri Nadav
View author publications
You can also search for this author in PubMed Google Scholar
Georgios Piliouras
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Lecturer, Department of Computer Science, University of Ioannina, Dourouti University Campus, 45110, Ioannina, Greece
Spyros Kontogiannis
Department of Informatics and Telecommunications Panepistimiopolis, Ilissia, Professor, National and Kapodistrian University of Athens, 15784, Athens, Greece
Elias Koutsoupias
Computer Engineering and Informatics Department (CEID), University of Patras, 26500, Patras, Greece
Paul G. Spirakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nadav, U., Piliouras, G. (2010). No Regret Learning in Oligopolies: Cournot vs. Bertrand. In: Kontogiannis, S., Koutsoupias, E., Spirakis, P.G. (eds) Algorithmic Game Theory. SAGT 2010. Lecture Notes in Computer Science, vol 6386. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16170-4_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-16170-4_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16169-8
Online ISBN: 978-3-642-16170-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics