Cooperativity in Networks of Pattern Recognizing Stochastic Learning Automata

Barto, Andrew G.; Anandan, P.; Anderson, Charles W.

doi:10.1007/978-1-4757-1895-9_16

Andrew G. Barto²,
P. Anandan² &
Charles W. Anderson²

765 Accesses
6 Citations

Abstract

A class of learning tasks is described that combines aspects of learning automaton tasks and supervised learning pattern-classification tasks. We call these associative reinforcement learning tasks. An algorithm is presented, called the associative reward-penalty, or A _R−P, algorithm, for which a form of optimal performance has been proved. This algorithm simultaneously generalizes a class of stochastic learning automata and a class of supervised learning pattern-classification methods. Simulation results are presented that illustrate the associative reinforcement learning task and the performance of the the A _R−P algorithm. Additional simulation results are presented showing how cooperative activity in networks of interconnected A _R−P automata can olve difficult nonlinear associative learning problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

K.S. Narendra and M.A.L. Thathachar, “Learning Automata—A Survey,” IEEE Trans. Syst., Man, Cybern., vol. 4, pp. 323–334, 1974.
MathSciNet MATH Google Scholar
K.S. Narendra and S. Lakshmivarahan, “Learning Automata—A Critique,” J. Cybern. and Inf. Sci., vol. 1, pp. 53–65, 1977.
Google Scholar
P. Mars, K.S. Narendra, and M. Crystall, “Learning Automata Control of Computer Communication Networks,” Proc. Third Yale Workshop on Applications of Adaptive Systems Theory, 1983.
Google Scholar
L.G. Mason, “Learning Automata and Telecommunications Switching,” Proc. Third Yale Workshop on Applications of Adaptive Systems Theory, 1983.
Google Scholar
R.M. Wheeler and K.S. Narendra, “Models for Decentralized Decisionmaking,” Report No. 8403, Electrical Engineering, Yale University, 1984.
Google Scholar
R.A. Jarvis, “Teaching a Stochastic Automaton to Skillfully Play Hand/Eye Games,” J. of Cybern. and Inf. Sci., vol. 1, pp. 161–177, 1977.
Google Scholar
S. Lakshmivarahan, Learning Algorithms and Applications Springer-Verlag, New York, 1981.
Google Scholar
I.H. Witten, “An Adaptive Optimal Controller for Discrete-time Markov Environments,” Inf. and Contr., vol. 34, pp. 286–295, 1977.
Article MathSciNet MATH Google Scholar
A.G. Barto and P. Anandan, “Pattern Recognizing Stochastic Learning Automata,” IEEE Trans. on Syst., Man, Cybern., vol. 15, pp. 360–375, 1985.
MathSciNet MATH Google Scholar
R.O. Duda and P.E. Hart, Pattern Classification and Scene Analysis Wiley, New York, 1973.
Google Scholar
M.A.L. Thathachar and K.R. Ramakrishnan, “An Automaton Model of a Hierarchical Learning System,” Proc. 8th Triennial World Congress, IFAC Control Science and Technology, Kyoto, Japan, pp. 1065–1070, 1981.
Google Scholar
A.G. Barto, Editor. “Simulation Experiments with Goal-seeking Adaptive Elements,” Air Force Wright Aeronautical Laboratories/Avionics Laboratory Technical Report AFWAL-TR-84–1022, Wright-Patterson AFB, Ohio, 1984.
Google Scholar
A.G. Barto, C.W. Anderson, and R.S. Sutton, “Synthesis of Nonlinear Control Surfaces by a Layered Associative Search Network,” Biol. Cybern., vol. 43, pp. 175–185, 1982.
Article MATH Google Scholar
A.G. Barto and R.S. Sutton, “Landmark Learning: An Illustration of Associative Search,” Biol. Cybern., vol. 42, pp. 1–8, 1981.
Article MATH Google Scholar
A.G. Barto, R.S. Sutton, and C.W. Anderson, “Neuronlike Elements That Can Solve Difficult Learning Control Problems,” IEEE Trans. on Syst., Man, Cybern., vol. SMC13, pp. 834–846, 1983.
Google Scholar
A.G. Barto, R.S. Sutton, and P.S. Brouwer, “Associative Search Network: A Reinforcement Learning Associative Memory,” Biol. Cybern., vol. 40, pp 201–211, 1981.
Article MATH Google Scholar
R.S. Sutton and A.G. Barto, “Toward a Modern Theory of Adaptive Networks: Expectation and Prediction,” Psych. Rev., vol. 88, pp. 135–171, 1981.
Article Google Scholar
J.A. Feldman (Ed.), Special Issue on Connectionist Models and Their Applications, Cognitive Science, vol. 9, 1985.
Google Scholar
G. Hinton and J. Anderson, Parallel Models of Associative Memory Erlbaum, Hilsdale, N. J., 1981.
Google Scholar
T. Kohonen, Associative Memory: A System Theoretic Approach Springer, Berlin, 1977.
Google Scholar
A.H. Klopf, The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence Hemisphere, Washington, D.C., 1982.
Google Scholar
D.H. Ackley, G.E. Hinton, and T.J. Sejnowski, “A Learning Algorithm for Boltzmann Machines,” Cognitive Science, vol. 9, pp. 147–169, 1985.
Article Google Scholar
D.E. Rumelhart, G.E. Hinton, and R.J. Williams, “Learning Internal Representations by Error Propagation,” ICS Report 8506, Institute for Cognitive Science, University of California, San Diego, 1985.
Google Scholar
B. Widrow and M.E. Hoff, “Adaptive Switching Circuits,” 1960 WESCON Convention Record Part IV, pp. 96–104, 1960.
Google Scholar
R.L. Kasyap, C.C. Blaydon, and K.S. Fu, “Stochastic Approximation,” in Adaptation, Learning and Pattern Recognition Systems: Theory and Applications J.M. Mendel and K.S. Fu, Eds. Academic Press, New York, 1970.
Google Scholar
B. Widrow, N.K.. Gupta, and S. Maitra, “Punish/Reward: Learning with a Critic in Adaptive Threshold Systems,” IEEE Trans. on Syst., Man, Cybern., vol. 5, pp. 455465, 1973.
Google Scholar
S. Lakshmivarahan, “e-optimal Learning Algorithms—Non-absorbing Barrier Type,” Technical Report EECS 7901, School of Electrical Engineering and Computer Sciences, University of Oklahoma, Norman, Oklahoma, 1979.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Information Science, University of Massachusetts, Amherst, MA, 01003, Canada
Andrew G. Barto, P. Anandan & Charles W. Anderson

Authors

Andrew G. Barto
View author publications
You can also search for this author in PubMed Google Scholar
P. Anandan
View author publications
You can also search for this author in PubMed Google Scholar
Charles W. Anderson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Systems Science, Yale University, New Haven, Connecticut, USA
Kumpati S. Narendra

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Barto, A.G., Anandan, P., Anderson, C.W. (1986). Cooperativity in Networks of Pattern Recognizing Stochastic Learning Automata. In: Narendra, K.S. (eds) Adaptive and Learning Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-1895-9_16

Download citation

DOI: https://doi.org/10.1007/978-1-4757-1895-9_16
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-1897-3
Online ISBN: 978-1-4757-1895-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics