Zero-Sum Games for Discrete-Time Systems Based on Model-Free ADP

Zhang, Huaguang; Liu, Derong; Luo, Yanhong; Wang, Ding

doi:10.1007/978-1-4471-4757-2_8

Huaguang Zhang⁵,
Derong Liu⁶,
Yanhong Luo⁵ &
…
Ding Wang⁶

Part of the book series: Communications and Control Engineering ((CCE))

3392 Accesses

Abstract

In this chapter, zero-sum games are investigated for discrete-time systems based on the model-free ADP method. First, an effective data-based optimal control scheme is developed via the iterative ADP algorithm to find the optimal controller of a class of discrete-time zero-sum games for Roesser type 2-D systems. Since the exact models of many 2-D systems cannot be obtained inherently, the iterative ADP method is expected to avoid the requirement of exact system models. Second, a data-based optimal output feedback controller is developed for solving the zero-sum games of a class of discrete-time systems, whose merit is that not only knowledge of the system model is not required, but neither is information of the system states. Theoretical analysis and a simulation study show the validity of the methods presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aangenent W, Kostic D, de Jager B, Van de Molengraft R, Steinbuch M (2005) Data-based optimal control. In: Proceedings of American control conference, Portland, pp 1460–1465
Chapter Google Scholar
Abu-Khalaf M, Lewis FL (2008) Neurodynamic programming and zero-sum games for constrained control systems. IEEE Trans Neural Netw 19:1243–1252
Article Google Scholar
Abu-Khalaf M, Lewis FL, Huang J (2006) Policy iterations on the Hamilton–Jacobi–Isaacs equation for H _∞ state feedback control with input saturation. IEEE Trans Autom Control 51:1989–1995
Article MathSciNet Google Scholar
Al-Tamimi A, Abu-Khalaf M, Lewis FL (2007) Adaptive critic designs for discrete-time zero-sum games with application to H _∞ control. IEEE Trans Syst Man Cybern, Part B, Cybern 37:240–247
Article Google Scholar
Al-Tamimi A, Lewis FL, Abu-Khalaf M (2007) Model-free q-learning designs for linear discrete-time zero-sum games with application to H _∞ control. Automatica 43:473–481
Article MathSciNet MATH Google Scholar
Al-Tamimi A, Lewis FL, Abu-Khalaf M (2007) Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 43:473–481
Article MathSciNet MATH Google Scholar
Basar T, Bernhard P (1995) H _∞ optimal control and related minimax design problems. Birkhauser, Basel
Book MATH Google Scholar
Basar T, Olsder GJ (1982) Dynamic noncooperative game theory. Academic Press, New York
MATH Google Scholar
Bertsekas DP (2003) Convex analysis and optimization. Athena Scientific, Boston
MATH Google Scholar
Cui LL, Zhang HG, Zhang X, Luo YH (2011) Adaptive critic design based output feedback control for discrete-time zero-sum games. In: Proceedings of IEEE symposium on adaptive dynamic programming and reinforcement learning, France, pp 190–195
Chapter Google Scholar
Hua X, Mizukami K (1994) Linear-quadratic zero-sum differential games for generalized state space systems. IEEE Trans Autom Control 39:143–147
Article MATH Google Scholar
Li CJ, Fadali MS (1991) Optimal control of 2-D systems. IEEE Trans Autom Control 36:223–228
Article MathSciNet MATH Google Scholar
Luenberger DG (1969) Optimization by vector space methods. Wiley, New York
MATH Google Scholar
Tsai JS, Li JS, Shieh LS (2002) Discretized quadratic optimal control for continuous-time two-dimensional systems. IEEE Trans Circuits Syst I, Fundam Theory Appl 49:116–125
Article MathSciNet Google Scholar
Uetake Y (1992) Optimal smoothing for noncausal 2-D systems based on a descriptor model. IEEE Trans Autom Control 37:1840–1845
Article MathSciNet MATH Google Scholar
Wei QL, Zhang HG, Cui LL (2009) Data-based optimal control for discrete-time zero-sum games of 2-D systems using adaptive critic designs. Acta Autom Sin 35:682–692
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Science Engin., Northeastern University, Shenyang, People’s Republic of China
Huaguang Zhang & Yanhong Luo
Institute of Automation, Laboratory of Complex Systems, Chinese Academy of Sciences, Beijing, People’s Republic of China
Derong Liu & Ding Wang

Authors

Huaguang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Derong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yanhong Luo
View author publications
You can also search for this author in PubMed Google Scholar
Ding Wang
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhang, H., Liu, D., Luo, Y., Wang, D. (2013). Zero-Sum Games for Discrete-Time Systems Based on Model-Free ADP. In: Adaptive Dynamic Programming for Control. Communications and Control Engineering. Springer, London. https://doi.org/10.1007/978-1-4471-4757-2_8

Download citation

DOI: https://doi.org/10.1007/978-1-4471-4757-2_8
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4756-5
Online ISBN: 978-1-4471-4757-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics