Model-free Optimal Tracking Control for an Aircraft Skin Inspection Robot with Constrained-input and Input Time-delay via Integral Reinforcement Learning
- 4 Downloads
This paper presents a model-free optimal tracking control algorithm for an aircraft skin inspection robot with constrained-input and input time-delay. To tackle the input time-delay problem, the original system is transformed into a delay-free system with constrained-input and unknown input coupling term. In order to overcome the optimal control problem subject to constrained-input, a discounted value function is employed. In general, it is known that the HJB equation does not admit a classical smooth solution. Moreover, since the input coupling term of the delay-free system is unknown, a model-free integral reinforcement learning(IRL) algorithm which only requires the system sampling data generated by arbitrary different control inputs and external disturbances is proposed. The model-free IRL method is implemented on an actor-critic neural network (NN) structure. A system sampling data set is utilized to learn the value function and control policy. Finally, the simulation verifies the effectiveness of the proposed algorithm.
KeywordsConstrained-input input time-delay model-free reinforcement learning
Unable to display preview. Download preview PDF.
- Z. Liu, P. Smith, T. Park, A. A. Trindade, and Q. Hui, “Automated contaminant source localization in spatiotemporal fields: a response surface and experimental design approach,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 47, no. 3, pp. 569–583, February 2017.CrossRefGoogle Scholar
- J. Sun and C. Liu, “Distributed fuzzy adaptive back-stepping optimal control for nonlinear multimissile guidance systems with input saturation,” IEEE Transactions on Fuzzy Systems, vol. 27, no. 3, pp. 447–461, July 2019.Google Scholar
- V. Mien, M. Mavrovouniotis, and S. Ge, “An adaptive backstepping nonsingular fast terminal sliding mode control for robust fault tolerant control of robot manipulators,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 99, pp. 1–11, January 2018.Google Scholar
- Y. Li, K. Sun, and S. Tong, “Observer-based adaptive fuzzy fault-tolerant optimal control for SISO nonlinear systems,” IEEE Transactions on Cybernetics, vol. 99, pp. 1–13, Feburary 2018.Google Scholar
- H. Wu, S. Song, K. You, and C. Wu, “Depth control of model-free AUVs via reinforcement learning,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 99, pp. 1–12, January 2018.Google Scholar
- J. Hou, D. Wang, D. Liu, and Y. Zhang, “Model-free H ∞ optimal tracking control of constrained nonlinear systems via an iterative adaptive learning algorithm,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 99, pp. 1–12, August 2018.Google Scholar
- S. Li, L. Ding, H. Gao, Y. J. Liu, N. Li, and Z. Deng, “Reinforcement learning neural network-based adaptive control for state and input time-delayed wheeled mobile robots,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2018. DOI: https://doi.org/10.1109/TSMC.2018.2870724