The functional equations of undiscounted denumerable state Markov renewal programming

Mann, Elke

doi:10.1007/978-1-4899-0574-1_6

Elke Mann²

390 Accesses
2 Citations

Abstract

In this paper we want to establish conditions for the existence of a finite solution of the functional equations

$$ g\left( i \right) = \mathop {\max }\limits_{a \in a\left( 1 \right)} \mathop {\;\Sigma }\limits_{j \in I} \;p\left( {i,a,j} \right)g\left( j \right),\;i \in I,\;$$

(1)

$$ v\left( i \right) = \mathop {\max }\limits_{a \in {A_o}\left( i \right)} \;\left[ {r\left( {i,a} \right) - t\left( {i,a} \right)g\left( i \right) + \mathop \Sigma \limits_{j \in I} p\left( {i,a,j} \right)v\left( j \right)} \right],i \in I$$

(2)

where I is a denumerable set, A(i) is a compact metric space for all i ϵ I, 0 < t(i,a) < ∞, − ∞ < r (i,a) < ∞, p (i,a,j) ⩾ 0 for all i, j ϵ I, a ϵ A(i) and $ \sum\limits_{j \in I} {p\left( {i,a,j} \right){\text{ }} = {\text{ }}1} $ for all i ϵ I, a ϵ A(i).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Birkhoff, G. (1948). Lattice theory. American Math. Society Colloquium, Volume 25.
Google Scholar
Blackwell, D. (1962). Discrete dynamic programming. Ann. Math. Stat. 33, 719–726.
Article MathSciNet MATH Google Scholar
Chung, K.L. (1967). Markov chains with stationary transition probabilities. Springer, Berlin.
MATH Google Scholar
Çinlar, E. (1974). Periodicity in Markov renewal theory. Advances in Appl. Probability 6, 61–78.
Article MathSciNet MATH Google Scholar
Deppe, H. (1985). Continuity of mean recurrence times in denumerable semi Markov processes. Z. Wahrscheinlichkeitstheorie verw. Gebiete 69.
Google Scholar
Deppe, H. (1981). Semiregenerative processes with costs. Preprint 475, SFB 72, University of Bonn.
Google Scholar
Federgruen, A., A. Hordijk and H.C. Tijms (1979). Denumerable state semi Markov decision processes with unbounded costs, average reward criterion. Stoc.Proc.Appl. 9, 223–225.
Article MathSciNet MATH Google Scholar
Federgruen, A., P.J. Schweitzer and H.C. Tijms (1983). Denumerable undiscounted semi-Markov decision processes with unbounded rewards. Math.of Oper. Res. 8, 298–313.
Article MathSciNet MATH Google Scholar
Hordijk, A. (1974). Dynamic programming and Markov potential theory. Math. Centre Tract 51, Mathematisch Centrum, Amsterdam.
Google Scholar
Hordijk, A. and R. Dekker (1983). Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards. Report 83-36, Institute of Appl. Math, and Comp. Science, University of Leiden.
Google Scholar
Hordijk, A. and K. Sladky (1977). Sensitive optimality criteria in countable state dynamic programming. Math.of Oper. Res. 2, 1–14.
Article MathSciNet MATH Google Scholar
Kolonko, M. (1980a). Dynamische Optimierung unter Unsicherheit in einem Semi-Markoff Modell mit abzählbarem Zustandsraum. Dissertation, Institut für Angewandte Mathematik, Universität Bonn.
Google Scholar
Kolonko, M. (1980b). A countable Markov chain with reward structurecontinuity of the average reward. Preprint 415, SFB 72, University of Bonn.
Google Scholar
Mann, E. (1983a). Optimality equations and bias-optimality in bounded Markov decision processes. Preprint 574, SFB 72, University of Bonn. To appear in Optimization 16.
Google Scholar
Mann, E. (1983b). Optimalitätsgleichungen und optimale Politiken für sensitive Kriterien. Operations Research Proceedings 1983, Springer Berlin Heidelberg.
Google Scholar
Schweitzer, P.J. (1968). Perturbation theory and finite Markov chains. J. Appl. Prob. 5, 401–413.
Article MathSciNet MATH Google Scholar
Schweitzer, P.J. (1982). Solving MDP functional equations by lexicographic optimization. RAIRO 16, 91–98.
MathSciNet MATH Google Scholar
Veinott, A.F. (1966). On finding optimal policies in discrete dynamic programming. Ann. Math. Stat. 37, 1284–1294.
Article MathSciNet MATH Google Scholar
Wijngaard, J. (1975). Stationary Markov decision problems, discrete time, general state space. Proefschrift, Technische Hogeschool Eindhoven.
Google Scholar
Zijm, H. (1984). The optimality equations in multichain denumerable state Markov decision processes with the average cost criterion: the unbounded cost case. CQM-Note 022, Centre of Quantitative Methods, Eindhoven.
Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Angewandte Mathematik, Universität Bonn, Wegerlerstr.6, 5300, Bonn 1, Germany
Elke Mann

Authors

Elke Mann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Free University of Brussels, Brussels, Belgium
Jacques Janssen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mann, E. (1986). The functional equations of undiscounted denumerable state Markov renewal programming. In: Janssen, J. (eds) Semi-Markov Models. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-0574-1_6

Download citation

DOI: https://doi.org/10.1007/978-1-4899-0574-1_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-0576-5
Online ISBN: 978-1-4899-0574-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics