Sufficient Optimality Conditions for Semi-Markov Decision Processes with Incomplete State-Information: Undiscounted Case

Drăguţ, Maria

doi:10.1007/978-94-009-3859-5_26

Maria Drăguţ¹

Part of the book series: Czechoslovak Academy of Sciences ((TPCI,volume 10A-B))

141 Accesses

Abstract

This paper presents a particular case of a Bayesian dynamic controlled model, namely an infinite-horizon partially observed one, where the core process is a controlled finite state space, discrete-time, semi-Markov process.

We assume that the times of the control reset and the noise corrupted observations of the core process occur at times of core process transitions. The control employed each time of the core process transitions is allowed to be functionally dependent on the sample path of the core process only through the history of the corrupted observations.

Based on the construction of a sufficient statistic, the reduction of the considered controlled models with incomplete- information to controlled models with completely state-information was established and conditions for the average cost optimality criterion were stated in Drăguţ (1983).

In order to outline algorithms to calculate the optimal control policy and optimal payoff functions, new conditions must be imposed on the state structure of the partially observed semi-Markov process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Drăguţ M. (1981): Non-discounted semi-Markov decision processes with incomplete state-information. In: Proc. of the Sixth Brasov Conference, 1979, Publishing House of the Academy of RSR, 403–411.
Google Scholar
Drăguţ M. (1983): On semi-Markov decision processes with incomplete state-information. Doctoral Thesis, Centre of Mathematical Statistics, Bucharest, Romania.
Google Scholar
Hinderer K. (1970): Foundations of non-stationary dynamic programming with discrete time parameter. In: Lecture Notes in Oper. Research and Mathem. Systems, vol. 33, Springer- Verlag.
Google Scholar
Iosifescu M.: On invariant densities for piecewise monotonic transformation. To be published.
Google Scholar
Kurano M. (1976): On the existence of an optimal stationary I-policy in non-discounted Markovian decision process with incomplete state-information. Bull. Math. Statist. 17, No. 3–4, 75–81.
MathSciNet Google Scholar
Rhenius D. (1974): Incomplete information in Markov decision models. Ann. Statist. 2, No. 2, 1327–1334.
Article MathSciNet MATH Google Scholar
Ross S.M. (1970): Average cost semi-Markov decision processes. J. Appl. Probab. 7, No. 3, 649–656.
Article MATH Google Scholar
Sondik E. (1978): The optimal control of partially observable Markov processes over the infinite horizon: discounted costs. Oper. Res. 26, No. 2, 282–304.
Article MathSciNet MATH Google Scholar
Tijms H.C.: On dynamic programming with arbitrary state space, compact action space and the average return as criterion. To be published.
Google Scholar
White C.C. (1976): Procedures for the solution of a finite horizon partially observed semi-Markov optimization problems. Oper. Res. 24, No. 2, 348–358.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Central Mathematical Institute Centre of Mathematical Statistics, 174 Stirbei Voda Street, 77 104, Bucharest, 6, Romania
Maria Drăguţ

Authors

Maria Drăguţ
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Jan Ámos Višek

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Drăguţ, M. (1988). Sufficient Optimality Conditions for Semi-Markov Decision Processes with Incomplete State-Information: Undiscounted Case. In: Višek, J.Á. (eds) Transactions of the Tenth Prague Conference. Czechoslovak Academy of Sciences, vol 10A-B. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3859-5_26

Download citation

DOI: https://doi.org/10.1007/978-94-009-3859-5_26
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-8216-7
Online ISBN: 978-94-009-3859-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics