Lateral Intraparietal Cortex and Reinforcement Learning during a Mixed-Strategy Game
Open Access
- 3 June 2009
- journal article
- Published by Society for Neuroscience in Journal of Neuroscience
- Vol. 29 (22) , 7278-7289
- https://doi.org/10.1523/jneurosci.1479-09.2009
Abstract
Activity of the neurons in the lateral intraparietal cortex (LIP) displays a mixture of sensory, motor, and memory signals. Moreover, they often encode signals reflecting the accumulation of sensory evidence that certain eye movements might lead to a desirable outcome. However, when the environment changes dynamically, animals are also required to combine the information about its previously chosen actions and their outcomes appropriately to update continually the desirabilities of alternative actions. Here, we investigated whether LIP neurons encoded signals necessary to update an animal's decision-making strategies adaptively during a computer-simulated matching-pennies game. Using a reinforcement learning algorithm, we estimated the value functions that best predicted the animal's choices on a trial-by-trial basis. We found that, immediately before the animal revealed its choice, ∼18% of LIP neurons changed their activity according to the difference in the value functions for the two targets. In addition, a somewhat higher fraction of LIP neurons displayed signals related to the sum of the value functions, which might correspond to the state value function or an average rate of reward used as a reference point. Similar to the neurons in the prefrontal cortex, many LIP neurons also encoded the signals related to the animal's previous choices. Thus, the posterior parietal cortex might be a part of the network that provides the substrate for forming appropriate associations between actions and outcomes.Keywords
This publication has 58 references indexed in Scilit:
- Valuation of uncertain and delayed rewards in primate prefrontal cortexNeural Networks, 2009
- Cortical mechanisms for reinforcement learning in competitive gamesPhilosophical Transactions Of The Royal Society B-Biological Sciences, 2008
- Posterior Cingulate Cortex Mediates Outcome-Contingent Allocation of BehaviorNeuron, 2008
- Prefrontal Coding of Temporally Discounted Values during Intertemporal ChoiceNeuron, 2008
- Value Representations in the Primate Striatum during Matching BehaviorNeuron, 2008
- Game theory and neural basis of social decision makingNature Neuroscience, 2008
- Posterior Parietal Cortex Encodes Autonomously Selected Motor PlansNeuron, 2007
- Neural mechanism for stochastic behaviour during a competitive gameNeural Networks, 2006
- Cortical substrates for exploratory decisions in humansNature, 2006
- Neurons in the orbitofrontal cortex encode economic valueNature, 2006