Markov decision processes

White, C. C.

doi:10.1007/1-4020-0611-X_580

C. C. White III³

224 Accesses
2 Citations

The finite-state, finite-action Markov decision process is a particularly simple and relatively tractable model of sequential decision making under uncertainty. It has been applied in such diverse fields as health care, highway maintenance, inventory, ma-chine maintenance, cash-flow management, and regulation of water reservoir capacity (Derman, 1970; Hernandez-Lermer, 1989; Ross, 1970; White, 1969). Here we present a definition of a Markov decision process and illustrate it with an example, followed by a discussion of the various solution procedures for several different types of Markov decision processes, all of which are based on dynamic programming (Bertsekas, 1987; Howard, 1971; Puterman, 1994; Sennott, 1999).

PROBLEM FORMULATION

Let k ∈ {0, 1,..., K − 1} represent the kth stage or decision epoch, that is, when the kth decision must be selected; K < ∞ represents the planning horizon of the Markov decision process. Let s _k be the state of the system to be con-trolled at stage k....

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 532.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bertsekas, D.P. (1987). Dynamic Programming: Deterministic and Stochastic Models, Wiley-Interscience, New York.
Google Scholar
Derman, C. (1970). Finite State Markovian Decision Processes, Academic Press, New York.
Google Scholar
Hernandez-Lermer, O. (1989). Adaptive Markov Control Processes. Springer-Verlag, New York.
Google Scholar
Howard, R. (1971). Dynamic Programming and Markov Processes, MIT Press, Cambridge, Massachusetts.
Google Scholar
Puterman, M.L. (1994). Markov Decision Processes: Discrete Dynamic Programming, Wiley-Interscience, New York.
Google Scholar
Ross, S.M. (1970). Applied Probability Models with Optimization Applications, Holden-Day, San Francisco.
Google Scholar
Sennott, L.I. (1999). Stochastic Dynamic Programming and the Control of Queueing Systems, John Wiley, New York.
Google Scholar
White, D.J. (1969). Markov Decision Processes, John Wiley, Chichester, UK.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Michigan, Ann Arbor, USA
C. C. White III

Authors

C. C. White III
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Robert H. Smith School of Business, University of Maryland, College Part, Maryland, USA
Saul I. Gass
School of Information Technology & Engineering, George Mason University, Fairfax, Virginia, USA
Carl M. Harris

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

White, C.C. (2001). Markov decision processes . In: Gass, S.I., Harris, C.M. (eds) Encyclopedia of Operations Research and Management Science. Springer, New York, NY. https://doi.org/10.1007/1-4020-0611-X_580

Download citation

DOI: https://doi.org/10.1007/1-4020-0611-X_580
Published: 25 October 2005
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-7923-7827-3
Online ISBN: 978-1-4020-0611-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics