Stabilising Hebbian Learning with a Third Factor in a Food Retrieval Task

Thompson, Adedoyin Maria; Porr, Bernd; Wörgötter, Florentin

doi:10.1007/11840541_26

Adedoyin Maria Thompson²⁵,
Bernd Porr²⁵ &
Florentin Wörgötter²⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4095))

Included in the following conference series:

International Conference on Simulation of Adaptive Behavior

1694 Accesses
1 Citations

Abstract

When neurons fire together they wire together. This is Donald Hebb’s famous postulate. However, Hebbian learning is inherently unstable because synaptic weights will self amplify themselves: the more a synapse is able to drive a postsynaptic cell the more the synaptic weight will grow. We present a new biologically realistic way how to stabilise synaptic weights by introducing a third factor which switches on or off learning so that self amplification is minimised. The third factor can be identified by the activity of dopaminergic neurons in VTA which fire when a reward has been encountered. This leads to a new interpretation of the dopamine signal which goes beyond the classical prediction error hypothesis. The model is tested by a real world task where a robot has to find “food disks” in an environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hebb, D.O.: The organization of behavior: A neurophychological study. Wiley-Interscience, New York (1949)
Google Scholar
Kosco, B.: Differential hebbian learning. In: Denker, J.S. (ed.) Neural Networks for computing. AIP conference proceedings, Snowbird, Utah, vol. 151, pp. 277–282. American Institute of Physics, New York (1986)
Google Scholar
Porr, B., Wörgötter, F.: Isotropic Sequence Order learning. Neural Comp. 15, 831–864 (2003)
Article MATH Google Scholar
Bailey, C.H., Giustetto, M., Huang, Y.Y., Hawkins, R.D., Kandel, E.R.: Is heterosynaptic modulation essential for stabilizing Hebbian plasticity and memory? Nat. Rev. Neurosci. 1(1), 11–20 (2000)
Article Google Scholar
Grossberg, S., Schmajuk, N.: Neural dynamics of adaptive timing and temporal discrimination during associative learning. Neural Networks 2, 79–102 (1989)
Article Google Scholar
Verschure, P.F.M.J., Voegtlin, T., Douglas, R.J.: Environmentally mediated synergy between perception and behaviour in mobile robots. Nature 425, 620–624 (2003)
Article Google Scholar
Porr, B., Wörgötter, F.: Isotropic sequence order learning in a closed loop behavioural system. Roy. Soc. Phil. Trans. Math., Phys. & Eng. Sciences 361(1811), 2225–2244 (2003)
Article Google Scholar
Ziemke, T.: Are robots embodied? In: First international workshop on epigenetic robotics Modeling Cognitive Development in Robotic Systems, Lund, vol. 85 (2001)
Google Scholar
Sutton, R.: Learning to predict by method of temporal differences. Machine Learning 3(1), 9–44 (1988)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, 2002th edn. Bradford Books, MIT Press (1998)
Google Scholar
Markram, H., Lübke, J., Frotscher, M., Sakman, B.: Regulation of synaptic efficacy by coincidence of postsynaptic aps and epsps. Science 275, 213–215 (1997)
Article Google Scholar
Saudargiene, A., Porr, B., Wörgötter, F.: How the shape of pre- and postsynaptic signals can influence stdp: A biophysical model. Neural Comp. 16, 595–626 (2004)
Article MATH Google Scholar
Centonze, D., Picconi, B., Gubellini, P., Bernardi, G., Calabresi, P.: Dopaminergic control of synaptic plasticity in the dorsal striatum. Eur. J. Neurosci. 13(6), 1071–1077 (2001)
Article Google Scholar
Reynolds, J.N., Wickens, J.R.: Dopamine dependent plasticity of corticostriatal synapses. Neural Networks 15, 507–521 (2002)
Article Google Scholar
Wörgötter, F., Porr, B.: Temporal sequence learning, prediction and control - a review of different models and their relation to biological mechanisms. Neural Comp. 17, 245–319 (2005)
Article Google Scholar
Zhang, L.I., Tao, H.W., Holt, C.E., Harris, W.A., Poo, M.: A critical window for cooperation and competition among developing retinotectal synapses. Nature 395, 37–44 (1998)
Article Google Scholar
Schultz, W.: Dopamine neurons and their role in reward mechanisms. Curr. Opin. Neurobiol. 7(2), 191–197 (1997)
Article Google Scholar
Prescott, T.J., González, F.M.M., Gurney, K., Humpries, M.D., Redgrave, P.: A robot model of the basal ganglia: Behaviour and intrinsic processing. Neural Networks (in press, 2006)
Google Scholar
Dayan, P., Balleine, B.W.: Reward, motivation and reinforcement learning. Neuron 36, 285–298 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics & Electrical Engineering, University of Glasgow, G12 8LT, Glasgow, Scotland, United Kingdom
Adedoyin Maria Thompson & Bernd Porr
Bernstein Center of Computational Neuroscience, University Göttingen, Germany
Florentin Wörgötter

Authors

Adedoyin Maria Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Porr
View author publications
You can also search for this author in PubMed Google Scholar
Florentin Wörgötter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Cognitive Sciences and Technologies, LARRAL, Via S. Martino della Battaglia 44, 00185, Roma, Italy
Stefano Nolfi
Institute of Cognitive Science and Technology, (ISTC-CNR), Via San Martino della Battaglia 44, 00185, Rome, Italy
Gianluca Baldassarre
Institute of Cognitive Science and Technology, ISTC-CNR, Via S. Martino della Battaglia 44, 00185, Rome, Italy
Raffaele Calabretta & Davide Marocco &
The Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campusvej 55, 5230, Odense M, Denmark
John C. T. Hallam
UPMC Univ Paris 6, FRE2507, ISIR, F-75016, Paris, France
Jean-Arcady Meyer
Laboratory of Autonomous Robotics and Artificial Life, Institute of Cognitive, Sciences and Technologies, National Research Council, Rome, Italy
Orazio Miglino
Institute of Cognitive Sciences and Technologies, National Research Council, 44, Via San Martino della, 00185, Rome, Battaglia, Italy
Domenico Parisi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thompson, A.M., Porr, B., Wörgötter, F. (2006). Stabilising Hebbian Learning with a Third Factor in a Food Retrieval Task. In: Nolfi, S., et al. From Animals to Animats 9. SAB 2006. Lecture Notes in Computer Science(), vol 4095. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11840541_26

Download citation

DOI: https://doi.org/10.1007/11840541_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-38608-7
Online ISBN: 978-3-540-38615-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics