On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam’s Razor

Tuyls, Karl; Verbeeck, Katja; Maes, Sam

doi:10.1007/3-540-45023-8_32

Karl Tuyls³,
Katja Verbeeck³ &
Sam Maes³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2691))

Included in the following conference series:

International Central and Eastern European Conference on Multi-Agent Systems

701 Accesses
1 Citations

Abstract

Modeling learning agents in the context of Multi-agent Systems requires an adequate understanding of their dynamic behaviour. Usually, these agents are modeled similar to the different players in a standard game theoretical model. Unfortunately traditional Game Theory is static and limited in its usefelness.

Evolutionary Game Theory improves on this by providing a dynamics which describes how strategies evolve over time. In this paper, we discuss three learning models whose dynamics are related to the Replicator Dynamics(RD). We show how a classical Reinforcement Learning(RL) technique, i.e. Q-learning relates to the RD. This allows to better understand the learning process and it allows to determine how complex a RL model should be. More precisely, Occam’s Razor applies in the framework of games, i.e. the simplest model (Cross) suffices for learning equilibria. An experimental verification in all three models is presented.

Author funded by a doctoral grant of the institute for advancement of scientific technological research in Flanders (IWT).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Börgers, T., Sarin, R., Learning Through Reinforcement and Replicator Dynamics. Journal of Economic Theory, Volume 77, Number 1, November 1997.
Google Scholar
Hofbauer, J., Sigmund, K., Evolutionary Games and Population Dynamics, Cambridge University Press, 1998.
Google Scholar
Narendra, K., Thathachar, M., Learning Automata: An Introduction. Prentice-Hall (1989).
Google Scholar
Redondo, F.V., Game Theory and Economics, Cambridge University Press, (2001).
Google Scholar
Schneider, T.D., Evolution of biological information. journal of NAR, volume 28, pages 2794–2799, 2000.
Article Google Scholar
Stauffer, D., Life, Love and Death: Models of Biological Reproduction and Aging. Institute for Theoretical physics, Köln, Euroland, 1999.
Google Scholar
Sutton, R. S., Barto, A.G.: Reinforcement Learning: An introduction. Cambridge, MA: MIT Press (1998).
Google Scholar
Tuyls, K., Lenaerts, T., Verbeeck, K., Maes, S. and Manderick, B, Towards a Relation Between Learning Agents and Evolutionary Dynamics. Proceedings of BNAIC 2002. KU Leuven, Belgium.
Google Scholar
Tuyls, K., Verbeeck, K. and Lenaerts, T., A Selection-Mutation model for Qlearning in MAS. Accepted at AAMAS 2003. Melbourne, Australia.
Google Scholar
Weibull, J.W., Evolutionary Game Theory, MIT Press, (1996).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, CoMo, VUB, Belgium
Karl Tuyls, Katja Verbeeck & Sam Maes

Authors

Karl Tuyls
View author publications
You can also search for this author in PubMed Google Scholar
Katja Verbeeck
View author publications
You can also search for this author in PubMed Google Scholar
Sam Maes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electrical Engineering, Dept. of Cybernetics, Czech Technical University, 16627, Praha 6, Czech Republic
Vladimír Mařík & Michal Pěchouček &
Siemens AG, CT, IC 6, Otto-Hahn-Ring 6, 81730, München, Germany
Jörg Müller

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tuyls, K., Verbeeck, K., Maes, S. (2003). On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam’s Razor. In: Mařík, V., Pěchouček, M., Müller, J. (eds) Multi-Agent Systems and Applications III. CEEMAS 2003. Lecture Notes in Computer Science(), vol 2691. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45023-8_32

Download citation

DOI: https://doi.org/10.1007/3-540-45023-8_32
Published: 27 May 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40450-7
Online ISBN: 978-3-540-45023-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics