Abstract
An input generalization problem is one of the most important ones in applying reinforcement learning to real robot tasks. To cope with this problem, we propose a self-partitioning state space algorithm which can make non-uniform quantization of the multidimensional continuous state space. This method recursively splits its continuous state space into some coarse spaces called tentative states based on the relevance test for immediate reward r and discounted future reward Q which are collected during Q-learning process. When it finds out that a tentative state is relevant by the statistical test on a minimum description length (hereafter, MDL), it partitions this coarse space into finer spaces. To show that our algorithm has generalization capability, we apply our method to two tasks in which a soccer robot shoots a ball into a goal and prevent a ball from entering a goal. To show the validity of this method, the experimental results for computer simulation and a real robot are shown.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Chapman, D. and L. P. Kaelbling (1991). “Input generalization in delayed reinforcement learning: An alogorithm and performance comparisons”. In: Proc. of IJCAI-91. pp. 726–731.
Connel, J. H. and Mahadevan, S., Eds.) (1993). Robot Learning. Kluwer Academic Publishers.
Kaelbling, L. P. (1993). “Learning to achieve goals”. In: Proc. of IJCAI-93. pp. 1094–1098.
Moore, A. W. and C. G. Atkeson (1995). The Parti-game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State- spaces. Machine Learning 21, 199–233.
Nakamura, T. (1998(to appear)). “Development of Self-Learning Vision- Based Mobile Robots for Acquiring Soccer Robots Behaviors”. In: RoboCup-91: The First Robot World Cup Soccer Games and Conferences 1997 Springer-Verlag. pp. -.
Rissanen, J. (1989). Stochastic Complexity in Statistical Inquiry World Scientific.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nakamura, T. (1998). Development of Self-Learning Vision-Based Mobile Robots for Acquiring Soccer Robots Behaviors. In: Distributed Autonomous Robotic Systems 3. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-72198-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-72198-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-72200-4
Online ISBN: 978-3-642-72198-4
eBook Packages: Springer Book Archive