Control of a Free-Falling Cat by Policy-Based Reinforcement Learning
Autonomous control of nonholonomic systems is one big challenge, because there is no unified control method that can handle any nonholonomic systems even if the dynamics are known. To this challenge, in this study, we propose a reinforcement learning (RL) approach which enables the controller to acquire an appropriate control policy even without knowing the detailed dynamics. In particular, we focus on the control problem of a free-falling cat system whose dynamics are highly-nonlinear and nonholonomic. To accelerate the learning, we take the policy gradient method that exploits the basic knowledge of the system, and present an appropriate policy representation for the task. It is shown that this RL method achieves remarkably faster learning than that by the existing genetic algorithm-based method.
KeywordsFree-falling cat Nonholonomic system Policy gradient method
Unable to display preview. Download preview PDF.
- 1.Nakamura, Y.: Nonholonomic robot systems, Part 1: what’s a nonholonomic robot? Journal of RSJ 11, 521–528 (1993)Google Scholar
- 3.Mita, T.: Introduction to nonlinear control Theory-Skill control of underactuated robots. SHOKODO Co., Ltd. (2000) (in Japanese)Google Scholar
- 5.Holamoto, S., Funasako, T.: Feedback control of a planar space robot using a moving manifold. Journal of RSJ 25, 745–751 (1993)Google Scholar
- 8.Tsuchiya, C., et al.: SLIP: A sophisticated learner for instance-based policy using hybrid GA. Transactions of SICE 42, 1344–1352 (2006)Google Scholar