Andhill-98: A RoboCup Team which Reinforces Positioning with Observation

Andou, Tomohito

doi:10.1007/3-540-48422-1_27

Tomohito Andou⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1604))

Included in the following conference series:

Robot Soccer World Cup

482 Accesses

Abstract

On reinforcement learning with limited exploration, an agent’s policy tends to fall into a worthless local optimum. This paper proposes Observational Reinforcement Learning method with which the learning agent evaluates inexperienced policies and reinforces it. This method provides the agent more chances to escape from a local optimum without exploration. Moreover, this paper shows the effectiveness of the method from experiments in the RoboCup positioning problem. They are advanced experiments described in our RoboCup-97 paper [1].

This work was mainly done when the author was in Dept. of Mathematical and Computing Sciences, Tokyo Institute of Technology.

Download to read the full chapter text

Chapter PDF

Cooperative Multi-Agent Reinforcement Learning with Dynamic Target Localization: A Reward Sharing Approach

The Effectiveness Index Intrinsic Reward for Coordinating Service Robots

RL-Studio: A Tool for Reinforcement Learning Methods in Robotics

References

Andou, T.: “Refinement of Soccer Agents’ Positions Using Reinforcement Learning”, In RoboCup-97: Robot Soccer World Cup I, pp.373–388 (1998).
Google Scholar
Kaelbling, L. P.: “Reinforcement Learning: A Survey”, Journal of Artificial Intelligence Research 4, pp.237–285 (1996).
Google Scholar

Download references

Author information

Authors and Affiliations

C&C Media Research Laboratories, NEC Corporation, Miyazaki 4-1-1, Miyamae-ku, Kawasaki, 216-8555, Japan
Tomohito Andou

Authors

Tomohito Andou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Engineering, Department of Adaptive Machine Systems, Osaka University, Suita, Osaka, 565-0871, Japan
Minoru Asada
Sony Computer Science Laboratories, Inc., 3-14-13 Higasha-Gotanda, Shinagawa, Tokyo, 141-0022, Japan
Hiroaki Kitano
ERATO Kitano Symbiotic Systems Project, Japan Science and Technology Corporation, Suite 6A, M31, 6-31-15 Jinguu-mae, Shibuya, Tokyo, 150-0001, Japan
Hiroaki Kitano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Andou, T. (1999). Andhill-98: A RoboCup Team which Reinforces Positioning with Observation. In: Asada, M., Kitano, H. (eds) RoboCup-98: Robot Soccer World Cup II. RoboCup 1998. Lecture Notes in Computer Science(), vol 1604. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48422-1_27

Download citation

DOI: https://doi.org/10.1007/3-540-48422-1_27
Published: 26 October 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66320-1
Online ISBN: 978-3-540-48422-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Andhill-98: A RoboCup Team which Reinforces Positioning with Observation

Abstract

Chapter PDF

Similar content being viewed by others

Cooperative Multi-Agent Reinforcement Learning with Dynamic Target Localization: A Reward Sharing Approach

The Effectiveness Index Intrinsic Reward for Coordinating Service Robots

RL-Studio: A Tool for Reinforcement Learning Methods in Robotics

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Andhill-98: A RoboCup Team which Reinforces Positioning with Observation

Abstract

Chapter PDF

Similar content being viewed by others

Cooperative Multi-Agent Reinforcement Learning with Dynamic Target Localization: A Reward Sharing Approach

The Effectiveness Index Intrinsic Reward for Coordinating Service Robots

RL-Studio: A Tool for Reinforcement Learning Methods in Robotics

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation