Optimal Tracking Control Scheme for Discrete-Time Nonlinear Systems with Approximation Errors

Wei, Qinglai; Liu, Derong

doi:10.1007/978-3-642-39068-5_1

Qinglai Wei¹⁹ &
Derong Liu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7952))

Included in the following conference series:

International Symposium on Neural Networks

3788 Accesses
7 Citations

Abstract

In this paper, we aim to solve an infinite-time optimal tracking control problem for a class of discrete-time nonlinear systems using iterative adaptive dynamic programming (ADP) algorithm. When the iterative tracking control law and the iterative performance index function in each iteration cannot be accurately obtained, a new convergence analysis method is developed to obtain the convergence conditions of the iterative ADP algorithm according to the properties of the finite approximation errors. If the convergence conditions are satisfied, it is shown that the iterative performance index functions converge to a finite neighborhood of the greatest lower bound of all performance index functions under some mild assumptions. Neural networks are used to approximate the performance index function and compute the optimal tracking control policy, respectively, for facilitating the implementation of the iterative ADP algorithm. Finally, a simulation example is given to illustrate the performance of the present method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abu-Khalaf, M., Lewis, F.L.: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41(5), 779–791 (2005)
Article MathSciNet MATH Google Scholar
Abu-Khalaf, M., Lewis, F.L., Huang, J.: Neurodynamic programming and zero-sum games for constrained control systems. IEEE Transactions on Neural Networks 19(7), 1243–1252 (2008)
Article Google Scholar
Al-Tamimi, A., Abu-Khalaf, M., Lewis, F.L.: Adaptive critic designs for discrete-time zero-sum games with application to H _∞ control. IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics 37(7), 240–247 (2007)
Article Google Scholar
Prokhorov, D.V., Wunsch, D.C.: Adaptive critic designs. IEEE Transactions on Neural Networks 8(5), 997–1007 (1997)
Article Google Scholar
Hao, X., Jagannathan, S.: Model-free H _∞ stochastic optimal design for unknown linear networked control system zero-sum games via Q-learning. In: 2011 IEEE International Symposium on Intelligent Control (ISIC), Singapore, pp. 198–203 (2011)
Google Scholar
Liu, D., Zhang, Y., Zhang, H.: A self-learning call admission control scheme for CDMA cellular networks. IEEE Transactions on Neural Networks 16(5), 1219–1228 (2005)
Article Google Scholar
Tan, F., Liu, D., Guan, X., Xing, S.: Trajectory tracking control of nonholonomic mobile robot system based on unfalsified control theory. Control and Decision 25(6), 1693–1697 (2010)
Google Scholar
Wang, F., Jin, N., Liu, D., Wei, Q.: Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound. IEEE Transactions on Neural Networks 22(1), 24–36 (2011)
Article Google Scholar
Wang, D., Liu, D., Wei, Q.: Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach. Neurocomputing 78(1), 14–22 (2012)
Article Google Scholar
Wei, Q., Liu, D.: Nonlinear multi-person zero-sum differential games using iterative adaptive dynamic programming. In: 30th Chinese Control Conference (CCC), Yantai, China, pp. 2456–2461 (2011)
Google Scholar
Wei, Q., Liu, D.: An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state. Neural Networks 32, 236–244 (2012)
Article MATH Google Scholar
Wei, Q., Zhang, H., Dai, J.: Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing 72(7-9), 1839–1848 (2009)
Article Google Scholar
Werbos, P.J.: Advanced forecasting methods for global crisis warning and models of intelligence. General Systems Yearbook 22, 25–38 (1977)
Google Scholar
Werbos, P.J.: A menu of designs for reinforcement learning over time. In: Miller, W.T., Sutton, R.S., Werbos, P.J. (eds.) Neural Networks for Control, pp. 67–95. MIT Press, Cambridge (1991)
Google Scholar
Zhang, H., Wei, Q., Luo, Y.: A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Transactions on System, Man, and cybernetics-Part B: Cybernetics 38(4), 937–942 (2008)
Article Google Scholar
Zhang, H., Song, R., Wei, Q.: Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming. IEEE Transactions on Neural Networks 22(12), 1851–1862 (2011)
Article Google Scholar
Zhang, H., Wei, Q., Liu, D.: An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 47(1), 207–214 (2011)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
Qinglai Wei & Derong Liu

Authors

Qinglai Wei
View author publications
You can also search for this author in PubMed Google Scholar
Derong Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information and Communication Engineering, Dalian University of Technology, A 530, Chuangxinyuan Building, 116023, Dalian, China
Chengan Guo
Institute of Automation, Chinese Academy of Sciences, 100864, Beijing, China
Zeng-Guang Hou
School of Automation, Huazhong University of Science and Technology, 430074, Wuhan, China
Zhigang Zeng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wei, Q., Liu, D. (2013). Optimal Tracking Control Scheme for Discrete-Time Nonlinear Systems with Approximation Errors. In: Guo, C., Hou, ZG., Zeng, Z. (eds) Advances in Neural Networks – ISNN 2013. ISNN 2013. Lecture Notes in Computer Science, vol 7952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39068-5_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-39068-5_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39067-8
Online ISBN: 978-3-642-39068-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics