Abstract
This paper is devoted to the generalization of the numerical procedure for the finite-horizon partially observed semi-Markov optimization control problem due to White (1976) and also of the numerical procedure for the infinite horizon Markov optimization problem due to Sondik (1977)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Denardo E. (1967): Contraction mappings in the theory underlying dynamic programming. In SIAM Review, 9, 165–177
Howard R. (1971): Dynamic probabilistic systems. John Wiley
Lippman S. (1975): On dynamic programming with unbounded reward. In Management Science, vol.21, no.11, 1225–1232
Sondik E. (1978)The optimal control of partially observable Markov processes over the infinite horizon-discounted costs. In: Operations Research, vol.26, no.2, 282–303
White C. (1976): Procedures for the solution of a finite-horizon partially observed semi-Markov optimization problem. In: Operations Research, vol.24, no.2, 348–357
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1983 ACADEMIA, Publishing House of the Czechoslovak Academy of Sciences, Prague
About this chapter
Cite this chapter
Drăgţ, M. (1983). The Optimal Control of Partially Observable Semi-Markov Processes Over the Infinite Horizon: Discounted Costs. In: Transactions of the Ninth Prague Conference. Czechoslovak Academy of Sciences, vol 9A. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-7013-7_22
Download citation
DOI: https://doi.org/10.1007/978-94-009-7013-7_22
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-009-7015-1
Online ISBN: 978-94-009-7013-7
eBook Packages: Springer Book Archive