Markov decision processes (MDPs), also called stochastic dynamic programming, have been studied extensively since they were first introduced in 1960 [55]. MDPs were mainly used to model and solve dynamic decision-making problems with multi-periods under stochastic circumstances.


Discount Factor Markov Decision Process Reserve Price Reward Function Supervisory Control 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer Science+Business Media, LLC 2008

Personalised recommendations