Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans

Top Cited Papers

23 August 2006

journal article
research article
Published by Springer Nature in Nature

Vol. 442 (7106) , 1042-1045
https://doi.org/10.1038/nature05051

Abstract

The brain messenger dopamine is traditionally known as the 'pleasure molecule', linked with our desire for food and sex, as well as drug and gambling addictions. The precise function of dopamine in humans has remained elusive, and theories have relied almost exclusively on animal experiments. Using brain imaging technology, Pessiglione et al. scanned healthy human volunteers as they gambled for money after taking drugs that interfere with dopamine signals. Volunteers with boosted dopamine became better gamblers than their dopamine-suppressed counterparts. When dopamine levels were either enhanced or reduced by drugs, the scans showed that both reward-related learning and associated striatal activity are modulated, confirming the critical role of dopamine in integrating reward information for generation future decisions. An fMRI study of healthy human volunteers finds that when dopamine levels are either enhanced or reduced by drugs, both reward-related learning and associated striatal activity are modulated, confirming the critical role of dopamine in integrating reward information for future decisions. Theories of instrumental learning are centred on understanding how success and failure are used to improve future decisions1. These theories highlight a central role for reward prediction errors in updating the values associated with available actions2. In animals, substantial evidence indicates that the neurotransmitter dopamine might have a key function in this type of learning, through its ability to modulate cortico-striatal synaptic efficacy3. However, no direct evidence links dopamine, striatal activity and behavioural choice in humans. Here we show that, during instrumental learning, the magnitude of reward prediction error expressed in the striatum is modulated by the administration of drugs enhancing (3,4-dihydroxy-l-phenylalanine; l-DOPA) or reducing (haloperidol) dopaminergic function. Accordingly, subjects treated with l-DOPA have a greater propensity to choose the most rewarding action relative to subjects treated with haloperidol. Furthermore, incorporating the magnitude of the prediction errors into a standard action-value learning algorithm accurately reproduced subjects' behavioural choices under the different drug conditions. We conclude that dopamine-dependent modulation of striatal activity can account for how the human brain uses reward prediction errors to improve future decisions.

Keywords

This publication has 29 references indexed in Scilit:

Representation of Action-Specific Reward Values in the Striatum
Science, 2005
Motor control in basal ganglia circuits using fMRI and brain atlas approaches
Cerebral Cortex, 2005
By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism
Science, 2004
Dopamine, learning and motivation
Nature Reviews Neuroscience, 2004
Dissociable Roles of Ventral and Dorsal Striatum in Instrumental Conditioning
Science, 2004
A Neural Substrate of Prediction and Reward
Science, 1997
The involvement of nucleus accumbens dopamine in appetitive and aversive motivation
Behavioural Brain Research, 1994
The neural network of the basal ganglia as revealed by the study of synaptic connections of identified neurones
Trends in Neurosciences, 1990
Parallel Organization of Functionally Segregated Circuits Linking Basal Ganglia and Cortex
Annual Review of Neuroscience, 1986
Contemporary Animal Learning Theory
The American Journal of Psychology, 1982