A framework for studying the neurobiology of value-based decision making

Top Cited Papers

11 June 2008

journal article
review article
Published by Springer Nature in Nature Reviews Neuroscience

Vol. 9 (7) , 545-556
https://doi.org/10.1038/nrn2357

Abstract

Most behavioural and computational models of decision making assume that the following five processes are carried out at the time the decision is made: representation, action valuation, action selection, outcome valuation, and learning. On the basis of a sizeable body of animal and human behavioural evidence, several groups have proposed the existence of three different types of valuation systems: Pavlovian, habitual and goal-directed systems. Pavlovian systems assign value to only a small set of 'prepared' behaviours and thus have a limited behavioural repertoire. Nevertheless, they might be the driving force behind behaviours with important economic consequences (for example, overeating). Examples include preparatory behaviours, such as approaching a cue that predicts food, and consummatory behaviours, such as ingesting available food. Habit valuation systems learn to assign values to stimulus–response associations on the basis of previous experience through a process of trial-and-error. Examples of habits include a smoker's desire to have a cigarette at particular times of day (for example, after a meal) and a rat's tendency to forage in a cue-dependent location after sufficient training. Goal-directed systems assign values to actions by computing action–outcome associations and then evaluating the rewards that are associated with the different outcomes. An example of a goal-directed behaviour is the decision what to eat at a new restaurant. An important difference between habitual and goal-directed systems has to do with how they respond to changes in the environment. The goal-directed system updates the value of an action as soon as the value of its outcome changes, whereas the habit system only learns with repeated experience. The values computed by the three systems can be modulated by factors such as the risk that is associated with the decision, the time delay to the outcomes, and social considerations. The quality of the decisions made by an animal depend on how its brain assigns control to the different valuation systems in situations in which it has to make a choice between several potential actions that are assigned conflicting values. The learning properties of the habit system seem to be well-described by simple reinforcement algorithms, such as Q-learning. Some of the key computations that are predicted by these models are instantiated in the dopamine system.

Keywords

This publication has 124 references indexed in Scilit:

Game theory and neural basis of social decision making
Nature Neuroscience, 2008
Risky business: the neuroeconomics of decision making under uncertainty
Nature Neuroscience, 2008
Marketing actions can modulate neural representations of experienced pleasantness
Proceedings of the National Academy of Sciences, 2008
The neural correlates of subjective value during intertemporal choice
Nature Neuroscience, 2007
Neural signature of fictive learning signals in a sequential investment task
Proceedings of the National Academy of Sciences, 2007
Towards an executive without a homunculus: computational models of the prefrontal cortex/basal ganglia system
Philosophical Transactions Of The Royal Society B-Biological Sciences, 2007
Neurobiological Substrates of Dread
Science, 2006
Neurons in the orbitofrontal cortex encode economic value
Nature, 2006
Orbitofrontal Cortex, Associative Learning, and Expectancies
Neuron, 2005
The ecology and evolution of patience in two New World monkeys
Biology Letters, 2005