An Action Selection Method Based on Estimation of Other’s Intention in Time-Varying Multi-agent Environments

Kobayashi, Kunikazu; Kanehira, Ryu; Kuremoto, Takashi; Obayashi, Masanao

doi:10.1007/978-3-642-24965-5_9

Kunikazu Kobayashi¹⁸,
Ryu Kanehira¹⁸,
Takashi Kuremoto¹⁸ &
…
Masanao Obayashi¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7064))

Included in the following conference series:

International Conference on Neural Information Processing

2682 Accesses
3 Citations

Abstract

An action selection method based on the estimation of other’s intention is proposed to treat with time-varying multi-agent environments. Firstly, the estimation level of other’s intention is stratified as active, passive and thoughtful levels. Secondly, three estimation levels are formulated by a policy estimation method. Thirdly, a new action selection method by switching three estimation levels is proposed to cope with time-varying environments. Fourthly, the estimation methods of other’s intention are applied to the Q-learning method. Finally, through computer simulations using pursuit problems, the performance of the estimation methods are investigated. As a result, it is shown that the proposed method can select the appropriate estimation level in time-varying environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Stone, P., Veloso, M.: Multiagent Systems: A Survey from a Machine Learning Perspective. Autonomous Robots 8(3), 345–383 (2000)
Article Google Scholar
Kaelbling, L.P., Littman, M.L., Moore, A.P.: Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)
Google Scholar
Bratman, M.E.: Intention, Plans and Practical Reason. Harvard University Press (1987)
Google Scholar
Nagayuki, Y., Ishii, S., Ito, M., Shimohara, K., Doya, K.: A Multi-Agent Reinforcement Learning Method with the Estimation of the Other Agent’s Actions. In: Proceedings of the Fifth International Symposium on Artificial Life and Robotics, vol. 1, pp. 255–259 (2000)
Google Scholar
Nagayuki, Y., Ito, M.: Reinforcement Learning Method with the Inference of the Other Agent’s Policy for 2-Player Stochastic Games. Transactions on the Institute of Electronics, Information and Communication Engineers J86-D-I(11), 821–829 (2003) (in Japanese)
Google Scholar
Watkins, C.J.C.H., Dayan, P.: Q-learning. Machine Learning 8(3-4), 279–292 (1992)
Article MATH Google Scholar
Yokoyama, A., Omori, T., Ishikawa, S., Okada, H.: Modeling of Action Decision Process Based on Intention Estimation. In: Proceedings of Joint 4th International Conference on Soft Computing and Intelligent Systems and 9th International Symposium on advanced Intelligent Systems, vol. TH-F3-1 (2008)
Google Scholar
Yokoyama, A., Omori, T.: Model Based Analysis of Action Decision Process in Collaborative Task Based on Intention Estimation. Transactions on the Institute of Electronics, Information and Communication Engineers J92-A(11), 734–742 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Yamaguchi University, Tokiwadai 2-16-1, Ube, Yamaguchi, 755-8611, Japan
Kunikazu Kobayashi, Ryu Kanehira, Takashi Kuremoto & Masanao Obayashi

Authors

Kunikazu Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Ryu Kanehira
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Kuremoto
View author publications
You can also search for this author in PubMed Google Scholar
Masanao Obayashi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, 800, Dongchuan Road, 200240, Shanghai, China
Bao-Liang Lu & Liqing Zhang &
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
James Kwok

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kobayashi, K., Kanehira, R., Kuremoto, T., Obayashi, M. (2011). An Action Selection Method Based on Estimation of Other’s Intention in Time-Varying Multi-agent Environments. In: Lu, BL., Zhang, L., Kwok, J. (eds) Neural Information Processing. ICONIP 2011. Lecture Notes in Computer Science, vol 7064. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24965-5_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-24965-5_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24964-8
Online ISBN: 978-3-642-24965-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics