Abstract
In this chapter, we will discuss an alternative to Reinforcement Learning for solving Markov decision problems (MDPs) and Semi-Markov decision problems (SMDPs). The methodology that we will discuss in this chapter is generally referred to as Learning Automata. We have already discussed the theory of learning automata in the context of parametric optimization. It turns out that in control optimization too, in particular for solving problems modeled with Markov chains, learning automata methods can be useful.
If a man does not keep pace with his companion, perhaps it is because he hears a different drummer. Let him step to the music which he hears, however measured and far away.
— H.D. Thoreau (1817–1862)
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media New York
About this chapter
Cite this chapter
Gosavi, A. (2003). Control Optimization with Learning Automata. In: Simulation-Based Optimization. Operations Research/Computer Science Interfaces Series, vol 25. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3766-0_10
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3766-0_10
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5354-4
Online ISBN: 978-1-4757-3766-0
eBook Packages: Springer Book Archive