The first arrival model of continuous time Markovian decision programming — The discounted rate is 0

Wang, Ping

doi:10.1007/BF03167366

The first arrival model of continuous time Markovian decision programming — The discounted rate is 0

Published: October 1999

Volume 16, pages 423–430, (1999)
Cite this article

Japan Journal of Industrial and Applied Mathematics Aims and scope Submit manuscript

Ping Wang¹

23 Accesses
Explore all metrics

Abstract

This is the first paper studying the CTMDP model without any discounted factor: the first arrival at target set. It gives conditions not too strong which can establish the foundation of the problem, then studies the existence and the form of the solution to the optimal equation, at last shows a method to search for the optimal policy. These results are important in both the theory and application of Markovian decision programming.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discrete-time control with non-constant discount factor

Article 27 June 2020

A Constrained Optimization Problem with Applications to Constrained MDPs

Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion

Article 06 March 2023

References

D. Blackwell, Discounted dynamic programming. Ann Math. Statist.,36 (1965), 226–235.
Article MATH MathSciNet Google Scholar
K.L. Chung, Markov Chains with Stationary Transition Probability. Springer-Verlag, Berlin, 1967.
Google Scholar
S.C. Jaquette, Markov decision progress with a new optimality criterion; small interest rates. Ann. Math. Statist.,43 (1972), 1894–1901.
Article MATH MathSciNet Google Scholar
P. Kakumanu, Continuously discounted Markov decision model with countable state and action space. Ann. Math. Statist.,42 (1971), 919–926.
Article MATH MathSciNet Google Scholar
Lin Yuanlie, Continuous time model on the first arrival (I)-optimal in discounted moments. Acta. Appl. Math.,14 (1991), 115–124. (in Chinese)
MATH Google Scholar
N.J. Pullman, Matrix Theory and Its Applications. Marcel Dekkor Inc., New York, 1976.
MATH Google Scholar
K. Tanaka and C. Matsuda, On a continuously discounted vector valued Markov decision progress. J. Inform. Optim. Sci.,11 (1) (1990), 33–48.
MATH MathSciNet Google Scholar
Chengxi Zhu, The distributions of integral functionals of inhomogeneous Markov chains. Acta. Math. Sinica.,29 (3) (1986), 338–346.
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Finance, China Institute of Finance, 100029, Beijing, P.R. China
Ping Wang

Authors

Ping Wang
View author publications
You can also search for this author in PubMed Google Scholar

About this article

Cite this article

Wang, P. The first arrival model of continuous time Markovian decision programming — The discounted rate is 0. Japan J. Indust. Appl. Math. 16, 423–430 (1999). https://doi.org/10.1007/BF03167366

Download citation

Received: 20 March 1995
Revised: 12 October 1998
Issue Date: October 1999
DOI: https://doi.org/10.1007/BF03167366

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The first arrival model of continuous time Markovian decision programming — The discounted rate is 0

Abstract

Access this article

Similar content being viewed by others

Discrete-time control with non-constant discount factor

A Constrained Optimization Problem with Applications to Constrained MDPs

Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion

References

Author information

Authors and Affiliations

About this article

Cite this article

Key words

Navigation

The first arrival model of continuous time Markovian decision programming — The discounted rate is 0

Abstract

Access this article

Similar content being viewed by others

Discrete-time control with non-constant discount factor

A Constrained Optimization Problem with Applications to Constrained MDPs

Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion

References

Author information

Authors and Affiliations

About this article

Cite this article

Share this article

Key words

Search

Navigation