Skip to main content

No Regret Learning in Oligopolies: Cournot vs. Bertrand

  • Conference paper
Algorithmic Game Theory (SAGT 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6386))

Included in the following conference series:

Abstract

Cournot and Bertrand oligopolies constitute the two most prevalent models of firm competition. The analysis of Nash equilibria in each model reveals a unique prediction about the stable state of the system. Quite alarmingly, despite the similarities of the two models, their projections expose a stark dichotomy. Under the Cournot model, where firms compete by strategically managing their output quantity, firms enjoy positive profits as the resulting market prices exceed that of the marginal costs. On the contrary, the Bertrand model, in which firms compete on price, predicts that a duopoly is enough to push prices down to the marginal cost level. This suggestion that duopoly will result in perfect competition, is commonly referred to in the economics literature as the “Bertrand paradox”.

In this paper, we move away from the safe haven of Nash equilibria as we analyze these models in disequilibrium under minimal behavioral hypotheses. Specifically, we assume that firms adapt their strategies over time, so that in hindsight their average payoffs are not exceeded by any single deviating strategy. Given this no-regret guarantee, we show that in the case of Cournot oligopolies, the unique Nash equilibrium fully captures the emergent behavior. Notably, we prove that under natural assumptions the daily market characteristics converge to the unique Nash. In contrast, in the case of Bertrand oligopolies, a wide range of positive average payoff profiles can be sustained. Hence, under the assumption that firms have no-regret the Bertrand paradox is resolved and both models arrive to the same conclusion that increased competition is necessary in order to achieve perfect pricing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abernethy, J., Hazan, E., Rakhlin, A.: Competing in the dark: An efficient algorithm for bandit linear optimization. In: COLT (2008)

    Google Scholar 

  2. Amir, R.: Cournot oligopoly and the theory of supermodular games. Games and Economic Behavior (1996)

    Google Scholar 

  3. Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM Journal on Computing (2002)

    Google Scholar 

  4. Baye, M., Morgan, J.: A folk theorem for one-shot bertrand games. Economic Letters 65(1), 59–65 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  5. Bertrand, J.: Theorie mathematique de la richesse sociale. Journal des Savants 67, 499–508 (1883)

    Google Scholar 

  6. Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning and Games. Cambridge University Press, Cambridge (2006)

    Book  MATH  Google Scholar 

  7. Cournot, A.A.: Recherches sur les principes mathmatiques de la thorie des richesses (1838)

    Google Scholar 

  8. Dufwenberg, M., Gneezy, U.: Price competition and market concentration: An experimental study. International Journal of Industrial Organization 18 (2000)

    Google Scholar 

  9. Even-Dar, E., Mansour, Y., Nadav, U.: On the convergence of regret minimization dynamics in concave games. In: 41st ACM Symposium on Theory of Computing, STOC (2009)

    Google Scholar 

  10. Flaxman, A., Kalai, A.T., McMahan, B.: Online convex optimization in the bandit setting: Gradient descent without a gradient. In: SODA (2005)

    Google Scholar 

  11. Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1), 119–139 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  12. Fudenberg, D., Levine, D.K.: The theory of learning in games. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  13. Hazan, E., Agarwal, A., Kale, S.: Logarithmic regret algorithms for online convex optimization. Machine Learning 69(2-3), 169–192 (2007)

    Article  MATH  Google Scholar 

  14. Liu, L.: Correlated equilibrium of cournot oligopoly competition. Journal of Economic Theory (1996)

    Google Scholar 

  15. Mas-Colell, A., Winston, M., Green, J.: Microeconomic Theory. Oxford University Press, Oxford (1995)

    MATH  Google Scholar 

  16. Milgrom, P., Roberts, J.: Rationalizability, learning, and equilibrium in games with strategic complementarities. Econometrica 58(6), 1255–1277 (1990)

    Article  MathSciNet  MATH  Google Scholar 

  17. Milgrom, P., Roberts, J.: Adaptive and sophisticated learning in repeated normal form games. In: Games and Economic Behavior, pp. 82–100 (February 1991)

    Google Scholar 

  18. Rosen, J.: Existense and uniqueness of equilibrium points for concave n-person games

    Google Scholar 

  19. Sheth, J.N., Sisodia, R.S.: The Rule of Three: Surviving and Thriving in Competitive Markets

    Google Scholar 

  20. Theocharis, R.D.: On the stability of the cournot solution on the oligopoly problem. The Review of Economic Studies 27(2), 133–134 (1960)

    Article  Google Scholar 

  21. Vives, X.: Oligopoly Pricing: Old Ideas and New Tools. The MIT Press, Cambridge (2001)

    Google Scholar 

  22. Wu, J.: Correlated equilibrium of bertrand competition. In: Papadimitriou, C., Zhang, S. (eds.) WINE 2008. LNCS, vol. 5385, pp. 166–177. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  23. Yi, S.-S.: On the existence of a unique correlated equilibrium in cournot oligopoly. Economic Letters 54, 235–239 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  24. Young, H.: Strategic Learning and Its Limits. Oxford University Press, Oxford (2004)

    Book  Google Scholar 

  25. Zinkevich, M.: Online convex programming and generalized infitesimal gradient ascent. In: ICML (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nadav, U., Piliouras, G. (2010). No Regret Learning in Oligopolies: Cournot vs. Bertrand. In: Kontogiannis, S., Koutsoupias, E., Spirakis, P.G. (eds) Algorithmic Game Theory. SAGT 2010. Lecture Notes in Computer Science, vol 6386. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16170-4_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16170-4_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16169-8

  • Online ISBN: 978-3-642-16170-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics