Reinforcement Learning Approaches to Coordination in Cooperative Multi-agent Systems

Kapetanakis, Spiros; Kudenko, Daniel; Strens, Malcolm J. A.

doi:10.1007/3-540-44826-8_2

Spiros Kapetanakis³,
Daniel Kudenko³ &
Malcolm J. A. Strens⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2636))

Included in the following conference series:

625 Accesses
19 Citations

Abstract

We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multi-agent systems. Specifically, we focus on two novel approaches: one is based on a new action selection strategy for Q-learning [10], and the other is based on model estimation with a shared action-selection protocol. The new techniques are applicable to scenarios where mutual observation of actions is not possible.

To date, reinforcement learning approaches for such independent agents did not guarantee convergence to the optimal joint action in scenarios with high miscoordination costs. We improve on previous results [2] by demonstrating empirically that our extension causes the agents to converge almost always to the optimal joint action even in these difficult cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

C. Boutilier. Sequential optimality and coordination in multiagent systems. In Proceedings of the Sixteenth International Joint Conference on Articial Intelligence (IJCAI-99), pages 478–485, 1999.
Google Scholar
Caroline Claus and Craig Boutilier. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the Fifteenth National Conference on Articial Intelligence, pages 746–752, 1998.
Google Scholar
Drew Fudenberg and David K. Levine. The Theory of Learning in Games. MIT Press, Cambridge, MA, 1998.
MATH Google Scholar
Leslie Pack Kaelbling, Michael Littman, and Andrew W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 1996.
Google Scholar
Martin Lauer and Martin Riedmiller. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proceedings of the Seventeenth International Conference in Machine Learning, 2000.
Google Scholar
Sandip Sen and Mahendra Sekaran. Individual learning of coordination knowledge. JETAI, 10(3): 333–356, 1998.
MATH Google Scholar
Sandip Sen, Mahendra Sekaran, and John Hale. Learning to coordinate without sharing information. In Proceedings of the Twelfth National Conference on Artificial Intelligence, pages 426–431, Seattle, WA, 1994.
Google Scholar
S. Singh, T. Jaakkola, M. L. Littman, and C Szpesvari. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning Journal, 38(3):287–308, 2000.
Article MATH Google Scholar
Ming Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning, pages 330–337, 1993.
Google Scholar
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, Cambridge University, Cambridge, England, 1989.
Google Scholar
Gerhard Weiss. Learning to coordinate actions in multi-agent systems. In Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, volume 1, pages 311–316. Morgan Kaufmann Publ., 1993.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of York, Heslington, York, YO10 5DD, UK
Spiros Kapetanakis & Daniel Kudenko
Guidance and Imaging Solutions, QinetiQ, Ively Road, Farnborough, Hampshire, GU14 OLX, UK
Malcolm J. A. Strens

Authors

Spiros Kapetanakis
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Kudenko
View author publications
You can also search for this author in PubMed Google Scholar
Malcolm J. A. Strens
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computing, City University, London, EC1V 0HB, UK
Eduardo Alonso
Department of Computer Science, University of York, Heslington, York, YO10 5DD, UK
Daniel Kudenko & Dimitar Kazakov &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kapetanakis, S., Kudenko, D., Strens, M.J.A. (2003). Reinforcement Learning Approaches to Coordination in Cooperative Multi-agent Systems. In: Alonso, E., Kudenko, D., Kazakov, D. (eds) Adaptive Agents and Multi-Agent Systems. AAMAS AAMAS 2002 2001. Lecture Notes in Computer Science(), vol 2636. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44826-8_2

Download citation

DOI: https://doi.org/10.1007/3-540-44826-8_2
Published: 13 May 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40068-4
Online ISBN: 978-3-540-44826-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics