Control Optimization with Stochastic Dynamic Programming
This chapter focuses on a problem of control optimization — namely, the Markov decision problem. Our discussions will be at a very elementary level, and we will not attempt to prove any theorems.
KeywordsTransition Probability Matrix Stochastic Game Bellman Equation Policy Iteration Average Reward
Unable to display preview. Download preview PDF.