Behavior Acquisition Based on Multi-module Learning System in Multi-agent Environment

Takahashi, Yasutake; Edazawa, Kazuhiro; Asada, Minoru

doi:10.1007/978-3-540-45135-8_39

Yasutake Takahashi⁹,
Kazuhiro Edazawa⁹ &
Minoru Asada⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2752))

Included in the following conference series:

Robot Soccer World Cup

1022 Accesses
2 Citations

Abstract

The conventional reinforcement learning approaches have difficulties to handle the policy alternation of the opponents because it may cause dynamic changes of state transition probabilities of which stability is necessary for the learning to converge. This paper presents a method of multi-module reinforcement learning in a multiagent environment, by which the learning agent can adapt itself to the policy changes of the opponents. We show a preliminary result of a simple soccer situation in the context of RoboCup.

Download to read the full chapter text

Chapter PDF

A New Approach for Multi-agent Reinforcement Learning

Improving Multi-agent Reinforcement Learning with Imperfect Human Knowledge

Embedding multi-agent reinforcement learning into behavior trees with unexpected interruptions

Article Open access 25 January 2024

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Asada, M., Uchibe, E., Hosoda, K.: Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development. Artificial Intelligence 110, 275–292 (1999)
Article MATH Google Scholar
Singh, S.P.: Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning 8, 323–339 (1992)
MATH Google Scholar
Singh, S.P.: The effeicient learnig of multiple task sequences. Neural Information Processing Systems 4, 251–258 (1992)
Google Scholar
Takahashi, Y., Asada, M.: Vision-guided behavior acquisition of a mobile robot by multi-layered reinforcement learning. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, vol. 1, pp. 395–402 (2000)
Google Scholar
Sutton, R.S.: Integrated modeling and control based on reinforcement learning and dynamic programming. Advances in Neural Information Processing Systems 3, 471–478 (1991)
MathSciNet Google Scholar
Singh, S.P.: Reinforcement learning with a hierarchy of abstract models. National Conference on Artificial Intelligence, 202–207 (1992)
Google Scholar
Doya, K., Samejima, K., Katagiri, K., Kawato, M.: Multiple model-based reinforcement learning. Technical report, Kawato Dynamic Brain Project Technical Report, KDB-TR-08, Japan Science and Technology Corporation (June 2000)
Google Scholar
Haruno, M., Wolpert, D.M., Kawato, M.: Multiple paired forward-inverse models for human motor learning and control. Advances in Neural Information Processing Systems 11, 31–37 (1999)
Google Scholar
Haruno, M., Wolpert, D.M., Kawato, M.: Mosaic model for sensorimotor learning and control. Neural Computation 13, 2201–2220 (2001)
Article MATH Google Scholar
Tani, J., Nolfi, S.: Self-organization of modules and their hierarchy in robot learning problems: A dynamical systems approach. Technical report, Technical Report: SCSL-TR-97-008 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Emergent Robotics Area, Dept. of Adaptive Machine Systems, Graduate School of Engineering, Osaka University, Yamadaoka 2-1, Suita, Osaka, 565-0871, Japan
Yasutake Takahashi, Kazuhiro Edazawa & Minoru Asada

Authors

Yasutake Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Edazawa
View author publications
You can also search for this author in PubMed Google Scholar
Minoru Asada
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The MAVERICK Group, Computer Science Department, Bar Ilan University, Israel
Gal A. Kaminka
Institute for Systems and Robotics, Instituto Superior Técnico, Technical University of Lisbon,
Pedro U. Lima
Institut für Informatik, Freie Universität Berlin, Takustr. 9, 14195, Berlin, Germany
Raúl Rojas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Takahashi, Y., Edazawa, K., Asada, M. (2003). Behavior Acquisition Based on Multi-module Learning System in Multi-agent Environment. In: Kaminka, G.A., Lima, P.U., Rojas, R. (eds) RoboCup 2002: Robot Soccer World Cup VI. RoboCup 2002. Lecture Notes in Computer Science(), vol 2752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45135-8_39

Download citation

DOI: https://doi.org/10.1007/978-3-540-45135-8_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40666-2
Online ISBN: 978-3-540-45135-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Behavior Acquisition Based on Multi-module Learning System in Multi-agent Environment

Abstract

Chapter PDF

Similar content being viewed by others

A New Approach for Multi-agent Reinforcement Learning

Improving Multi-agent Reinforcement Learning with Imperfect Human Knowledge

Embedding multi-agent reinforcement learning into behavior trees with unexpected interruptions

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Behavior Acquisition Based on Multi-module Learning System in Multi-agent Environment

Abstract

Chapter PDF

Similar content being viewed by others

A New Approach for Multi-agent Reinforcement Learning

Improving Multi-agent Reinforcement Learning with Imperfect Human Knowledge

Embedding multi-agent reinforcement learning into behavior trees with unexpected interruptions

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation