An Iterative ADP Method to Solve for a Class of Nonlinear Zero-Sum Differential Games

Song, Ruizhuo; Wei, Qinglai; Li, Qing

doi:10.1007/978-981-13-1712-5_10

An Iterative ADP Method to Solve for a Class of Nonlinear Zero-Sum Differential Games

Ruizhuo Song⁵,
Qinglai Wei⁶ &
Qing Li⁵

Chapter
First Online: 29 December 2018

630 Accesses
2 Citations

Part of the book series: Studies in Systems, Decision and Control ((SSDC,volume 166))

Abstract

In this chapter, an iterative ADP method is presented to solve a class of continuous-time nonlinear two-person zero-sum differential games. The idea is to use ADP technique to obtain the optimal control pair iteratively which makes the performance index function reach the saddle point of the zero-sum differential games. When the saddle point does not exist, the mixed optimal control pair is obtained to make the performance index function reach the mixed optimum. Rigid proofs are proposed to guarantee the control pair stabilize the nonlinear system. And the convergent property of the performance index function is also proved. Neural networks are used to approximate the performance index function, compute the optimal control policy and model the nonlinear system respectively for facilitating the implementation of the iterative ADP method. Two examples are given to demonstrate the validity of the proposed method.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Jamshidi, M.: Large-Scale Systems-Modeling and Control. North-Holland, Amsterdam, The Netherlands (1982)
MATH Google Scholar
Chang, H., Marcus, S.: Two-person zero-sum markov games: receding horizon approach. IEEE Trans. Autom. Control 48(11), 1951–1961 (2003)
Article MathSciNet Google Scholar
Chen, B., Tseng, C., Uang, H.: Fuzzy differential games for nonlinear stochastic systems: suboptimal approach. IEEE Trans. Fuzzy Syst. 10(2), 222–233 (2002)
Article Google Scholar
Hwnag, K., Chiou, J., Chen, T.: Reinforcement learning in zero-sum Markov games for robot soccer systems. In: Proceedings of the 2004 IEEE International Conference on Networking, Sensing and Control Taipei, Taiwan, pp. 1110–1114 (2004)
Google Scholar
Laraki, R., Solan, E.: The value of zero-sum stopping games in continuous time. SIAM J. Control Optim. 43(5), 1913–1922 (2005)
Article MathSciNet Google Scholar
Leslie, D., Collins, E.: Individual Q-learning in normal form games. SIAM J. Control Optim. 44(2), 495–514 (2005)
Article MathSciNet Google Scholar
Gu, D.: A differential game approach to formation control. IEEE Trans. Control Syst. Technol. 16(1), 85–93 (2008)
Article Google Scholar
Basar, T., Olsder, G.: Dynamic Noncooperative Game Theory. Academic, New York (1982)
MATH Google Scholar
Altman, E., Basar, T.: Multiuser rate-based flow control. IEEE Trans. Commun. 46(7), 940–949 (1998)
Article Google Scholar
Goebel, R.: Convexity in zero-sum differential games. In: Proceedings of IEEE Conference on Decision and Control, pp. 3964–3969 (2002)
Google Scholar
Zhang, P., Deng, H., Xi, J.: On the value of two-person zero-sum linear quadratic differential games. In: Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference 2005 Seville, Spain, pp. 12–15 (2005)
Google Scholar
Hua, X., Mizukami, K.: Linear-quadratic zero-sum differential games for generalized state space systems. IEEE Trans. Autom. Control 39(1), 143–147 (1994)
Article MathSciNet Google Scholar
Jimenez, M., Poznyak, A.: Robust and adaptive strategies with pre-identification via sliding mode technique in LQ differential games. In: Proceedings of the 2006 American Control Conference Minneapolis, Minnesota, USA, pp. 14–16 (2006)
Google Scholar
Engwerda, J.: Uniqueness conditions for the affine open-loop linear quadratic differential game. Automatica 44(2), 504–511 (2008)
Article MathSciNet Google Scholar
Bertsekas, D.: Convex Analysis and Optimization. Athena Scientific, Belmont (2003)
MATH Google Scholar
Owen, G.: Game Theory. Acadamic Press, New York (1982)
MATH Google Scholar
Basar, T., Bernhard, P.: \(H\infty \) Optimal Control and Related Minimax Design Problems. Birkhäuser, Boston (1995)
MATH Google Scholar
Yong, J.: Dynamic programming and Hamilton–Jacobi–Bellman equation. Shanghai Science Press, Shanghai (1991)
Google Scholar
Padhi, R., Unnikrishnan, N., Wang, X., Balakrishman, S.: A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems. Neural Netw. 19(10), 1648–1660 (2006)
Article Google Scholar
Gupta, S.: Numerical Methods for Engineerings. Wiley Eastern Ltd. and New Age International Company, New Delhi (1995)
Google Scholar
Si, J., Wang, Y.: On-line learning control by association and reinforcement. IEEE Trans. Neural Netw. 12(2), 264–275 (2001)
Article MathSciNet Google Scholar
Enns, R., Si, J.: Helicopter trimming and tracking control using direct neural dynamic programming. IEEE Trans. Neural Netw. 14(7), 929–939 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Science and Technology Beijing, Beijing, China
Ruizhuo Song & Qing Li
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Qinglai Wei

Authors

Ruizhuo Song
View author publications
You can also search for this author in PubMed Google Scholar
Qinglai Wei
View author publications
You can also search for this author in PubMed Google Scholar
Qing Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruizhuo Song .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Song, R., Wei, Q., Li, Q. (2019). An Iterative ADP Method to Solve for a Class of Nonlinear Zero-Sum Differential Games. In: Adaptive Dynamic Programming: Single and Multiple Controllers. Studies in Systems, Decision and Control, vol 166. Springer, Singapore. https://doi.org/10.1007/978-981-13-1712-5_10

Download citation

DOI: https://doi.org/10.1007/978-981-13-1712-5_10
Published: 29 December 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1711-8
Online ISBN: 978-981-13-1712-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics