Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity
- 10 October 2006
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 103 (41) , 15224-15229
- https://doi.org/10.1073/pnas.0505220103
Abstract
The probability of choosing an alternative in a long sequence of repeated choices is proportional to the total reward derived from that alternative, a phenomenon known as Herrnstein's matching law. This behavior is remarkably conserved across species and experimental conditions, but its underlying neural mechanisms still are unknown. Here, we propose a neural explanation of this empirical law of behavior. We hypothesize that there are forms of synaptic plasticity driven by the covariance between reward and neural activity and prove mathematically that matching is a generic outcome of such plasticity. Two hypothetical types of synaptic plasticity, embedded in decision-making neural network models, are shown to yield matching behavior in numerical simulations, in accord with our general theorem. We show how this class of models can be tested experimentally by making reward not only contingent on the choices of the subject but also directly contingent on fluctuations in neural activity. Maximization is shown to be a generic outcome of synaptic plasticity driven by the sum of the covariances between reward and all past neural activities.Keywords
This publication has 30 references indexed in Scilit:
- A Biophysically Based Neural Model of Matching Law Behavior: Melioration by Stochastic SynapsesJournal of Neuroscience, 2006
- Indeterminacy in Brain and BehaviorAnnual Review of Psychology, 2005
- Matching Behavior and the Representation of Value in the Parietal CortexScience, 2004
- Prefrontal cortex and decision making in a mixed-strategy gameNature Neuroscience, 2004
- Deterministic Approximation of Stochastic Evolution in GamesEconometrica, 2003
- Direct Cortical Control of 3D Neuroprosthetic DevicesScience, 2002
- A re‐examination of probability matching and rational choiceJournal of Behavioral Decision Making, 2002
- The rat approximates an ideal detector of changes in rates of reward: Implications for the law of effect.Journal of Experimental Psychology: Animal Behavior Processes, 2001
- A Stochastic Learning Model of Economic BehaviorThe Quarterly Journal of Economics, 1973
- Operant Conditioning of Cortical Unit ActivityScience, 1969