Supervised learning in spiking neurons
- 768 Downloads
KeywordsSupport Vector Machine Conditioned Stimulus Unconditioned Stimulus Supervise Learning Classical Conditioning
A critical function of the brain is to learn behavioral responses appropriate to given sensory stimuli. In some cases, the appropriate response to a stimulus is explicitly provided. For example in classical conditioning, after pairing of an unconditioned stimulus that naturally drives a certain behavioral response with a second conditioned stimulus that is originally neutral, the animal learns to later respond to produce the behavior in response to the conditioned stimulus.
In statistics and machine learning, the problem of associating a particular class of inputs with an externally specified target output is called "supervised learning". Various statistical techniques have been developed for supervised learning, for example the support vector machine (SVM) which has seen success in a wide variety of applications. However, the way supervised learning could occur in spiking neurons remains unclear. One potential answer to this question is provided by an algorithm termed the "Tempotron", which has been successfully applied to real-world classification, but contains features unlikely to occur in real neurons: neurons are required to fire exactly one spike when a particular category is detected; furthermore the algorithm can only work reliably if input patterns are perfectly normalized.
We developed a robust algorithm for supervised learning of spatio-temporal patterns by spiking neurons receiving inputs through both excitatory and inhibitory synapses. By extending concepts from the SVM framework to spiking neurons, we generalized the Tempotron rule to mathematically derive a learning rule for classifying input patterns. In this framework, each target category is represented by a pool of output neurons that are required to together fire several spikes in response to a correct pattern, but not to fire otherwise. Performance is assessed by a hinge-loss error function on the summed activity of the pool. We suggest a candidate cellular mechanism for the rule, in which a training signal arriving when behavior is performed operates to consolidate an eligibility trace left by coincidence of presynaptic firing and postsynaptic depolarization.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.