Average Optimality for Finite Models

Guo, Xianping; Hernández-Lerma, Onésimo

doi:10.1007/978-3-642-02547-1_3

Average Optimality for Finite Models

Xianping Guo³ &
Onésimo Hernández-Lerma⁴

Chapter
First Online: 01 January 2009

3477 Accesses

Part of the book series: Stochastic Modelling and Applied Probability ((SMAP,volume 62))

Abstract

Chapter 3 deals with finite models, that is, continuous-time MDPs with a finite number of states and actions. The long-run expected average reward (AR) criterion and the n-bias (n=0,1,…) optimality criteria are introduced in Sect. 3.2. (Occasionally, we abbreviate expected average reward as EAR rather than expected AR.) For every n=0,1,…, formulas expressing the difference between the n-biases for any two policies are provided in Sect. 3.3. These formulas are used in Sect. 3.4 to characterize n-bias optimal policies. The policy iteration and the linear programming algorithms for computing optimal policies for each of the n-bias criteria are given in Sects. 3.5 and 3.6, respectively.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

School of Mathematics and Computational Science, Zhongshan University, Guangzhou, 510275, People’s Republic of China
Xianping Guo
Departamento de Matemáticas, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Apdo Postal 14-740, México, D.F., 07000, Mexico
Onésimo Hernández-Lerma

Authors

Xianping Guo
View author publications
You can also search for this author in PubMed Google Scholar
Onésimo Hernández-Lerma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xianping Guo or Onésimo Hernández-Lerma .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Guo, X., Hernández-Lerma, O. (2009). Average Optimality for Finite Models. In: Continuous-Time Markov Decision Processes. Stochastic Modelling and Applied Probability, vol 62. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02547-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-02547-1_3
Published: 18 September 2009
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02546-4
Online ISBN: 978-3-642-02547-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics