Abstract
In this chapter, the receding-horizon near-optimal tracking control problem about a class of continuous-time nonlinear systems with fully unknown dynamics are considered. The main challenges of this problem lie in two aspects: (1) Most existing systems only restrict their considerations to the state feedback part while the input channel parameters are assumed to be known. This chapter considers fully unknown system dynamics in both the state feedback channel and the input channel. (2) The optimal control of nonlinear systems requires the solution of nonlinear Hamilton–Jacobi–Bellman equations. Up to today, there are no systematic approaches in the existing literature to solve it accurately. A novel model-free adaptive near-optimal control method is presented to solve this problem by utilizing the Taylor expansion based problem relaxation, the universal approximation property of sigmoid neural networks, and the concept of sliding mode control. By making an approximation for the performance index, it is first relaxed to a quadratic program, and then a linear algebraic equation with unknown terms. An auxiliary system is designed to reconstruct the input-to-output property of the control systems with unknown dynamics, so as to tackle the difficulty caused by the unknown terms. Then, by considering the property of the sliding mode surface, an explicit adaptive near-optimal control law is derived from the linear algebraic equation. Theoretical analysis shows that the auxiliary system is convergent, the resultant closed-loop system is asymptotically stable, and the performance index asymptomatically converges to optimal. An illustrative example and experimental results are presented, which substantiate the efficacy of the presented method and verify the theoretical results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Liu, Y.J., Tong, S.: Barrier Lyapunov functions-based adaptive control for a class of nonlinear pure-feedback systems with full state constraints. Automatica 64, 70–75 (2016)
Wang, H., Shi, P., Li, H., Zhou, Q.: Adaptive neural tracking control for a class of nonlinear systems with dynamic uncertainties. IEEE Trans. Cybern. 47(10), 3075–3087 (2017)
Zhou, Q., Shi, P., Tian, Y., Wang, M.: Approximation-based adaptive tracking control for MIMO nonlinear systems with input saturation. IEEE Trans. Cybern. 45(10), 2119–2128 (2015)
Zhou, Q., Li, H., Wu, C., Wang, L., Ahn, C.K.: Adaptive fuzzy control of nonlinear systems with unmodeled dynamics and input saturation using small-gain approach 47(8), 1979–1989 (2017)
Li, S., Chen, S., Liu, B., Li, Y., Liang, Y.: Decentralized kinematic control of a class of collaborative redundant manipulators via recurrent neural networks. Neurocomputing 91, 1–10 (2012)
Li, S., Cui, H., Li, Y., Liu, B., Lou, Y.: Decentralized control of collaborative redundant manipulators with partial command coverage via locally connected recurrent neural networks. Neural Comput. Appl. 23(3), 1051–1060 (2013)
Jin, L., Zhang, Y., Li, S., Zhang, Y.: Modified ZNN for time-varying quadratic programming with inherent tolerance to noises and its application to kinematic redundancy resolution of robot manipulators. IEEE Trans. Ind. Electron. 63(11), 6978–6988 (2016)
Li, S., He, J., Li, Y., Rafique, M.U.: Distributed recurrent neural networks for cooperative control of manipulators: a game-theoretic perspective. IEEE Trans. Neural Netw. Learn. Syst. 28(2), 415–426 (2017)
Jin, L., Li, S., La, H.M., Luo, X.: Manipulability optimization of redundant manipulators using dynamic neural networks. IEEE Trans. Ind. Electron. 64(6), 4710–4720 (2017)
Li, Y., Li, S., Hannaford, B.: A novel recurrent neural network for improving redundant manipulator motion planning completeness. ICRA 2956–2961 (2018)
Zhang, Y., Li, S.: A neural controller for image-based visual servoing of manipulators with physical constraints. IEEE Trans. Neural Netw. Learn. Syst. 29(11), 5419–5429 (2018)
Li, S., Zhou, M., Luo, X.: Modified primal-dual neural networks for motion control of redundant manipulators with dynamic rejection of harmonic noises. IEEE Trans. Neural Netw. Learn. Syst. 29(10), 4791–4801 (2018)
Li, S., Wang, H., Rafique, M.U.: A novel recurrent neural network for manipulator control with improved noise tolerance. IEEE Trans. Neural Netw. Learn. Syst. 29(5), 1908–1918 (2018)
Jin, L., Li, S., Luo, X., Li, Y., Qin, B.: Neural dynamics for cooperative control of redundant robot manipulators. IEEE Trans. Ind. Inform. 14(9), 3812–3821 (2018)
Li, J., Zhang, Y., Li, S., Mao, M.: New discretization-formula-based zeroing dynamics for real-time tracking control of serial and parallel manipulators. IEEE Trans. Ind. Inform. 14(8), 3416–3425 (2018)
Chen, D., Zhang, Y., Li, S.: Tracking control of robot manipulators with unknown models: a jacobian-matrix-adaption method. IEEE Trans. Ind. Inform. 14(7), 3044–3053 (2018)
Zhang, Y., Li, S., Gui, J., Luo, X.: Velocity-level control with compliance to acceleration-level constraints: a novel scheme for manipulator redundancy resolution. IEEE Trans. Ind. Inform. 14(3), 921–930 (2018)
Xiao, L., Liao, B., Li, S., Zhang, Z., Ding, L., Jin, L.: Design and analysis of FTZNN applied to the real-time solution of a nonstationary Lyapunov equation and tracking control of a wheeled mobile manipulator. IEEE Trans. Ind. Inform. 14(1), 98–105 (2018)
Zhang, Y., Chen, S., Li, S., Zhang, Z.: Adaptive projection neural network for kinematic control of redundant manipulators with unknown physical parameters. IEEE Trans. Ind. Electron. 65(6), 4909–4920 (2018)
Zhang, Z., Lin, Y., Li, S., Li, Y., Yu, Z., Luo, Y.: Tricriteria optimization-coordination motion of dual-redundant-robot manipulators for complex path planning. IEEE Trans. Contr. Sys. Techn. 26(4), 1345–1357 (2018)
Jin, L., Li, S., Hu, B., Yi, C.: Dynamic neural networks aided distributed cooperative control of manipulators capable of different performance indices. Neurocomputing 291, 50–58 (2018)
Jin, L., Li, S., Yu, J., He, J.: Robot manipulator control using neural networks: a survey. Neurocomputing 285, 23–34 (2018)
Chen, D., Zhang, Y., Li, S.: Zeroing neural-dynamics approach and its robust and rapid solution for parallel robot manipulators against superposition of multiple disturbances. Neurocomputing 275, 845–858 (2018)
Li, S., Shao, Z., Guan, Y.: A dynamic neural network approach for efficient control of manipulators. IEEE Trans. Syst., Man, Cybern., Syst. 49(5), 932–941 (2019)
Zhang, Y., Li, S., Zhou, X.: Recurrent-neural-network-based velocity-level redundancy resolution for manipulators subject to a joint acceleration limit. IEEE Trans. Ind. Electron. 66(5), 3573–3582 (2019)
Zhang, Z., Chen, S., Li, S.: Compatible convex-nonconvex constrained QP-based dual neural networks for motion planning of redundant robot manipulators. IEEE Trans. Contr. Sys. Techn. 27(3), 1250–1258 (2019)
Xu, Z., Li, S., Zhou, X., Yan, W., Cheng, T., Huang, D.: Dynamic neural networks based kinematic control for redundant manipulators with model uncertainties. Neurocomputing 329, 255–266 (2019)
Li, S., Zhang, Y., Jin, L.: Kinematic control of redundant manipulators using neural networks. IEEE Trans. Neural Netw. Learn. Syst. in press
Mohammed, A.M., Li, S.: Dynamic neural networks for kinematic redundancy resolution of parallel stewart platforms. IEEE Trans. Cybern. 46(7), 1538–1550 (2016)
Jin, L., Zhang, Y.: G2-type SRMPC scheme for synchronous manipulation of two redundant robot Arms. IEEE Trans. Cybern. 45(2), 153–164 (2015)
Jin, L., Li, S., Xiao, L., Lu, R., Liao, B.: Cooperative motion generation in a distributed network of redundant robot manipulators with noises. IEEE Trans. Syst., Man, Cybern., Syst. 48(10), 1715–1724 (2018)
Jin, L., Li, S.: Distributed task allocation of multiple robots: a control perspective. IEEE Trans. Syst., Man, Cybern., Syst. 48(5), 693–701 (2018)
Jin, L., Li, S., Hu, B., Liu, M.: A survey on projection neural networks and their applications. Appl. Soft Comput. 76, 533–544 (2019)
Xiao, L., Li, K., Tan, Z., Zhang, Z., Liao, B., Chen, K., Jin, L., Li, S.: Nonlinear gradient neural network for solving system of linear equations. Inf. Process. Lett. 142, 35–40 (2019)
Xiang, Q., Liao, B., Xiao, L., Lin, L., Li, S.: Discrete-time noise-tolerant Zhang neural network for dynamic matrix pseudoinversion. Soft Comput. 23(3), 755–766 (2019)
Xiao, L., Li, S., Yang, J., Zhang, Z.: A new recurrent neural network with noise-tolerance and finite-time convergence for dynamic quadratic minimization. Neurocomputing 285, 125–132 (2018)
Xiao, L., Liao, B., Li, S., Chen, K.: Nonlinear recurrent neural networks for finite-time solution of general time-varying linear matrix equations. Neural Netw. 98, 102–113 (2018)
Xiao, L., Zhang, Z., Zhang, Z., Li, W., Li, S.: Design, verification and robotic application of a novel recurrent neural network for computing dynamic Sylvester equation. Neural Netw. 105, 185–196 (2018)
Zhang, Z., Lu, Y., Zheng, L., Li, S., Yu, Z., Li, Y.: A new varying-parameter convergent-differential neural-network for solving time-varying convex QP problem constrained by linear-equality. IEEE Trans. Automat. Contr. 63(12), 4110–4125 (2018)
Jin, L., Li, S.: Nonconvex function activated zeroing neural network models for dynamic quadratic programming subject to equality and inequality constraints. Neurocomputing 267, 107–113 (2017)
Jin, L., Li, S., Liao, B., Zhang, Z.: Zeroing neural networks: a survey. Neurocomputing 267, 597–604 (2017)
Mao, M., Li, J., Jin, L., Li, S., Zhang, Y.: Enhanced discrete-time Zhang neural network for time-variant matrix inversion in the presence of bias noises. Neurocomputing 207, 220–230 (2016)
Jin, L., Zhang, Y., Li, S.: Integration-enhanced Zhang neural network for real-time-varying matrix inversion in the presence of various kinds of noises. IEEE Trans. Neural Netw. Learn. Syst. 27(12), 2615–2627 (2016)
Li, S., Li, Y.: Nonlinearly activated neural network for solving time-varying complex sylvester equation. IEEE Trans. Cybern. 44(8), 1397–1407 (2014)
Li, S., Li, Y., Wang, Z.: A class of finite-time dual neural networks for solving quadratic programming problems and its k-winners-take-all application. Neural Netw. 39, 27–39 (2013)
Na, J., Ren, X., Zheng, D.: Adaptive control for nonlinear pure-feedback systems with high-order sliding mode observer. IEEE Trans. Neural Netw. Learn. Syst. 24(3), 370–382 (2013)
Kayacan, E., Kayacan, E., Ramon, H., Saeys, W.: Adaptive neuro-fuzzy control of a spherical rolling robot using sliding-mode-control-theory-based online learning algorithm. IEEE Trans. Cybern. 43(1), 170–179 (2013)
Shen, Q., Shi, P., Shi, Y.: Distributed adaptive fuzzy control for nonlinear multiagent systems via sliding mode observers. IEEE Trans. Cybern. 46(12), 3086–3097 (2016)
Liu, Y.J., Tong, S.C., Wang, D., Li, T.S., Chen, C.L.P.: Adaptive neural output feedback controller design with reduced-order observer for a class of uncertain nonlinear SISO systems. IEEE Trans. Neural Netw. 22(8), 1328–1334 (2011)
Dai, S.L., Wang, M., Wang, C.: Neural learning control of marine surface vessels with guaranteed transient tracking performance. IEEE Trans. Ind. Electron. 63(3), 1717–1727 (2016)
Kostarigka, A.K., Rovithakis, G.A.: Adaptive dynamic output feedback neural network control of uncertain MIMO nonlinear systems with prescribed performance. IEEE Trans. Neural Netw. Learn. Syst. 23(1), 138–149 (2012)
El-Ferik, S., Qureshi, A., Lewis, F.L.: Neuro-adaptive cooperative tracking control of unknown higher-order affine nonlinear systems. Automatica 50(3), 798–808 (2014)
Zhou, Q., Li, H., Shi, P.: Decentralized adaptive fuzzy tracking control for robot finger dynamics. IEEE Trans. Fuzzy Syst. 23(3), 501–510 (2015)
Yang, C., Wang, X., Cheng, L., Ma, H.: Neural-learning-based telerobot control with guaranteed performance. IEEE Trans. Cybern. 47(10), 3148–3159 (2017)
He, W., Yan, Z., Sun, C., Chen, Y.: Adaptive neural network control of a flapping wing micro aerial vehicle with disturbance observer. IEEE Trans. Cybern. 47(10), 3452–3465 (2017)
Cheng, L., Liu, W., Yang, C., Huang, T., Hou, Z., Tan, M.: A neural-network-based controller for piezoelectric-actuated stick-lip devices. IEEE Trans. Ind. Electron. 65(3), 2598–2607 (2018)
Chen, W.H., Ballance, D.J., Gawthrop, P.J.: Optimal control of nonlinear systems: a predictive control approach. Automatica 39(4), 633–641 (2003)
Chen, W.H., Balance, D.J., Gawthrop, P.J., Gribble, J.J., Reill, J.O.: Nonlinear PID predictive controller. Control Theory Appl. 146(6), 603–611 (1999)
Liu, C., Chen, W.H., Andrews, J.: Tracking control of small-scale helicopters using explicit nonlinear MPC augmented with disturbance observers. Control Eng. Pract. 20(3), 258–268 (2012)
Errouissi, R., Al-Durra, A., Muyeen, S.M.: Design and implementation of a nonlinear PI predictive controller for a grid-tied photovoltaic inverter. IEEE Trans. Ind. Electron. 64(2), 1241–1250 (2017)
Errouissi, R., Ouhrouche, M., Chen, W.H., Trzynadlowski, A.M.: Robust nonlinear predictive controller for permanent-magnet synchronous motors with an optimized cost function. IEEE Trans. Ind. Electron. 59(7), 2849–2858 (2011)
Liu, Y.J., Tong, S.: Optimal control-based adaptive NN design for a class of nonlinear discrete-time block-triangular systems. IEEE Trans. Cybern. 46(11), 2670–2680 (2016)
Liu, Y.J., Gao, Y., Tong, S., Li, Y.: Fuzzy approximation-based adaptive backstepping optimal control for a class of nonlinear discrete-time systems with dead-zone. IEEE Trans. Fuzzy Syst. 24(1), 16–28 (2016)
Sheu, J.W., Lin, W.S.: Adaptive optimal control for designing automatic train regulation for metro line. IEEE Trans. Control Syst. Technol. 20(5), 1319–1327 (2012)
Heydari, A., Balakrishnan, S.N.: Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics. IEEE Trans. Neural Netw. Learn. Syst. 24(1), 145–157 (2013)
Sun, K., Li, Y., Tong, S.: Fuzzy adaptive output feedback optimal control design for strict-feedback nonlinear systems. IEEE Trans. Syst. Man Cybern. Syst. 47(1), 33–44 (2017)
Huang, Y.: Neuro-observer based online finite-horizon optimal control for uncertain non-linear continuous-time systems. IET Control Theory Appl. 11(3), 401–410 (2017)
Dutta, S., Patchaikani, P.K., Behera, L.: Near-optimal controller for nonlinear continuous-time systems with unknown dynamics using policy iteration. IEEE Trans. Neural Netw. Learn. Syst. 27(7), 1537–1549 (2015)
Wang, Y., Miao, Z., Zhong, H., Pan, Q.: Simultaneous stabilization and tracking of nonholonomic mobile robots: a Lyapunov-based approach. IEEE Trans. Control Syst. Technol. 23(4), 1440–1450 (2015)
Zhang, Y., Yu, X., Yin, Y., Peng, C., Fan, Z.: Singularity-conquering ZG controllers of z2g1 type for tracking control of the IPC system. Int. J. Control 87(9), 1729–1746 (2014)
Baek, J., Jin, M., Han, S.: A new adaptive sliding-mode control scheme for application to robot manipulators. IEEE Trans. Ind. Electron. 63(5), 3632–3637 (2016)
Zhang, Y., Chen, D., Jin, L., Zhang, Y., Yin, Y.: GD-aided IOL (input-output linearisation) controller for handling affine-form nonlinear system with loose condition on relative degree. Int. J. Control 89(4), 757–769 (2016)
Zhang, Y., Tao, G., Chen, M.: Adaptive neural network based control of noncanonical nonlinear systems. IEEE Trans. Neural Netw. Learn. Syst. 27(9), 1864–1877 (2016)
Davila, J.: Exact tracking using backstepping control design and high-order sliding modes. IEEE Trans. Autom. Control 58(8), 2077–2081 (2013)
Castaneda, H., Plestan, F., Chriette, A., León-Morales, J.D.: Continuous differentiator based on adaptive second-order sliding-mode control for a 3-DOF helicopter. IEEE Trans. Ind. Electron. 63(9), 5786–5793 (2016)
Isidori, A.: Nonlinear Control Systems: An Introduction, 3rd edn. Springer, New York (1995)
Gu, D., Hu, H.: Receding horizon tracking control of wheeled mobile robots. IEEE Trans. Control Syst. Technol. 14(4), 743–749 (2016)
Cybenko, G.: Approximations by superpositions of a sigmoidal function. Math. Control Signals Syst. 2(4), 303–314 (1989)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70, 489–501 (2006)
Zhang, Z., Zhang, Y.: Acceleration-level cyclic-motion generation of constrained redundant robots tracking different paths. IEEE Trans. Syst., Man, Cybern. B, Cybern. 42(4), 1257–1269 (2012)
Hassine, I.M.B., Naouar, M.W., Mrabet-Bellaaj, N.: Model predictive-sliding mode control for three-phase grid-connected converters. IEEE Trans. Ind. Electron. 64(2), 1341–1349 (2017)
Chen, C.T.: Linear System Theory And Design. Oxford University Press, New York (1995)
Jin, L., Zhang, Y., Li, S., Zhang, Y.: Noise-tolerant ZNN models for solving time-varying zero-finding problems: a control-theoretic approach. IEEE Trans. Autom. Control 62(2), 992–997 (2017)
Joya, G., Atencia, M.A., Sandoval, F.: Hopfield neural networks for optimization: study of the different dynamics. Neurocomputing 43, 219–237 (2002)
Cichocki, A., Unbehauen, R.: Robust neural networks with on-line learning for blind identification and blind sepration of sources. IEEE Trans. Circuits Syst. I, Fundam. Theory Appl. 43(11), 894–906 (1996)
Levant, A.: Robust exact differentiation via sliding mode technique. Automatica 34(3), 379–384 (1998)
Guo, B.Z., Zhao, Z.L.: On convergence of tracking differentiator. Int. J. Control 84(4), 693–701 (2011)
Tian, D., Shen, H., Dai, M.: Improving the rapidity of nonlinear tracking differentiator via feedforward. IEEE Trans. Ind. Electron. 61(7), 3736–3743 (2014)
He, W., He, X., Ge, S.S.: Vibration control of flexible marine riser systems with input saturation. IEEE/ASME Trans. Mechatron. 21(1), 254–265 (2016)
Yu, J., Zhao, L., Yu, H., Lin, C., Dong, W.: Fuzzy finite-time command filtered control of nonlinear systems with input saturation. IEEE Trans. Cybern. 48(8), 2378–2387 (2018)
Jiang, B., Hu, Q., Friswell, M.I.: Fixed-time attitude control for rigid spacecraft with actuator saturation and faults. IEEE Trans. Control Syst. Technol. 24(5), 1892–1898 (2016)
Boyd, S., Sastry, S.S.: Necessary and sufficient conditions for parameter convergence in adaptive control. Automatica 22(6), 629–639 (1986)
Chen, W., Wen, C., Hua, S., Sun, C.: Distributed cooperative adaptive identification and control for a group of continuous-time systems with a cooperative PE condition via consensus. IEEE Trans. Autom. Control 59(1), 91–106 (2014)
Dixon, W.E., Dawson, D.M., Zhang, F., Zergeroglu, E.: Global exponential tracking control of a mobile robot system via a PE condition. IEEE Trans. Syst., Man, Cybern. B, Cybern. 30(1), 129–142 (2000)
Münz, U., Papachristodoulou, A., Allgöwer, F.: Robust consensus controller design for nonlinear relative degree two multi-agent systems with communication constraints. IEEE Trans. Autom. Control 56(1), 145–151 (2011)
Loría, A.: Observers are unnecessary for output-feedback control of lagrangian systems. IEEE Trans. Autom. Control 61(4), 905–920 (2016)
Chen, W., Ballance, D.J., Gawthrop, P.J., O’Reilly, J.: A nonlinear disturbance observer for robotic manipulators. IEEE Trans. Ind. Electron. 47(4), 932–938 (2000)
Chen, W.: Disturbance observer based control for nonlinear systems. IEEE/ASME Trans. Mechatron. 9(4), 706–710 (2004)
Yang, J., Li, S., Yu, X.: Sliding-mode control for systems with mismatched uncertainties via a disturbance observer. IEEE Trans. Ind. Electron. 60(1), 160–169 (2013)
Khalil, H.: Nonlinear Systems. Prentice Hall, New Jersey (1996)
Nagle, R.K., Saff, E., Snider, A.: Fundamentals of Differential Equations, 7th edn. Pearson Addison Wesley, Boston (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Zhang, Y., Li, S., Zhou, X. (2020). Model-Free Adaptive Near-Optimal Tracking Control. In: Deep Reinforcement Learning with Guaranteed Performance. Studies in Systems, Decision and Control, vol 265. Springer, Cham. https://doi.org/10.1007/978-3-030-33384-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-33384-3_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33383-6
Online ISBN: 978-3-030-33384-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)