Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity

10 October 2006

journal article
Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences

Vol. 103 (41) , 15224-15229
https://doi.org/10.1073/pnas.0505220103

Abstract

The probability of choosing an alternative in a long sequence of repeated choices is proportional to the total reward derived from that alternative, a phenomenon known as Herrnstein's matching law. This behavior is remarkably conserved across species and experimental conditions, but its underlying neural mechanisms still are unknown. Here, we propose a neural explanation of this empirical law of behavior. We hypothesize that there are forms of synaptic plasticity driven by the covariance between reward and neural activity and prove mathematically that matching is a generic outcome of such plasticity. Two hypothetical types of synaptic plasticity, embedded in decision-making neural network models, are shown to yield matching behavior in numerical simulations, in accord with our general theorem. We show how this class of models can be tested experimentally by making reward not only contingent on the choices of the subject but also directly contingent on fluctuations in neural activity. Maximization is shown to be a generic outcome of synaptic plasticity driven by the sum of the covariances between reward and all past neural activities.

Keywords

This publication has 30 references indexed in Scilit:

A Biophysically Based Neural Model of Matching Law Behavior: Melioration by Stochastic Synapses
Journal of Neuroscience, 2006
Indeterminacy in Brain and Behavior
Annual Review of Psychology, 2005
Matching Behavior and the Representation of Value in the Parietal Cortex
Science, 2004
Prefrontal cortex and decision making in a mixed-strategy game
Nature Neuroscience, 2004
Deterministic Approximation of Stochastic Evolution in Games
Econometrica, 2003
Direct Cortical Control of 3D Neuroprosthetic Devices
Science, 2002
A re‐examination of probability matching and rational choice
Journal of Behavioral Decision Making, 2002
The rat approximates an ideal detector of changes in rates of reward: Implications for the law of effect.
Journal of Experimental Psychology: Animal Behavior Processes, 2001
A Stochastic Learning Model of Economic Behavior
The Quarterly Journal of Economics, 1973
Operant Conditioning of Cortical Unit Activity
Science, 1969