Control Optimization with Learning Automata

Gosavi, Abhijit

doi:10.1007/978-1-4757-3766-0_10

Abhijit Gosavi⁴

Part of the book series: Operations Research/Computer Science Interfaces Series ((ORCS,volume 25))

1175 Accesses

Abstract

In this chapter, we will discuss an alternative to Reinforcement Learning for solving Markov decision problems (MDPs) and Semi-Markov decision problems (SMDPs). The methodology that we will discuss in this chapter is generally referred to as Learning Automata. We have already discussed the theory of learning automata in the context of parametric optimization. It turns out that in control optimization too, in particular for solving problems modeled with Markov chains, learning automata methods can be useful.

If a man does not keep pace with his companion, perhaps it is because he hears a different drummer. Let him step to the music which he hears, however measured and far away.

— H.D. Thoreau (1817–1862)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Department of Industrial Engineering, The State University of New York, Buffalo, New York, USA
Abhijit Gosavi

Authors

Abhijit Gosavi
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gosavi, A. (2003). Control Optimization with Learning Automata. In: Simulation-Based Optimization. Operations Research/Computer Science Interfaces Series, vol 25. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3766-0_10

Download citation

DOI: https://doi.org/10.1007/978-1-4757-3766-0_10
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5354-4
Online ISBN: 978-1-4757-3766-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics