Temporal Difference (TD) Based Critic-Actor Adaptive Control for a Fine Hand Motion Rehabilitation Robot
Robot assisted post-stroke rehabilitation training is an effective approach in delivering the highly intensive repetitive training, aiming to retrain the neural pathways in the brain thus to restore and improve the affected mobility skills. The adaptive control of robotic devices, especially assist-as-needed control providing exact assistive force intensity along the intended motion trajectory for fine motion, can be a complex but effective method. A temporal difference based critic-actor reinforcement learning control method is explored in this study. The effectiveness of the method is verified through Matlab simulation and implemented on a hand rehabilitation robotic device. Results suggest that the control system can fulfil the control task with high performance and reliability, thus holding the promise of improving the fine hand motion rehabilitation training efficiency.
KeywordsStroke recovery Rehabilitation robot Adaptive control Reinforcement learning Neural network
- 3.Kaelbling, L.P., M.L. Littman, and A.W. Moore. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4: 237–285.Google Scholar
- 5.Papahristou, N., and I. Refanidis. 2011. Training neural networks to play backgammon variants using reinforcement learning. In Applications of evolutionary computation, vol. 6624, 113–122, eds. Di Chio C., S. Cagnoni, C. Cotta, M. Ebner, A. Ekárt, and A. Esparcia-Alcázar, et al. Heidelberg: Springer.Google Scholar