Control of exploitation–exploration meta-parameter in reinforcement learning
- 1 June 2002
- journal article
- review article
- Published by Elsevier in Neural Networks
- Vol. 15 (4-6) , 665-687
- https://doi.org/10.1016/s0893-6080(02)00056-4
Abstract
No abstract availableKeywords
This publication has 51 references indexed in Scilit:
- Anterior Prefrontal Cortex Mediates Rule Learning in HumansCerebral Cortex, 2001
- Anterior Cingulate Cortex and Response Conflict: Effects of Frequency, Inhibition and ErrorsCerebral Cortex, 2001
- Online Model Selection Based on the Variational BayesNeural Computation, 2001
- Planning and acting in partially observable stochastic domainsPublished by Elsevier ,1998
- Anterior Cingulate Cortex, Error Detection, and the Online Monitoring of PerformanceScience, 1998
- A Neural Substrate of Prediction and RewardScience, 1997
- Novelty and Familiarity Activations in PET Studies of Memory Encoding and RetrievalCerebral Cortex, 1996
- A Neural System for Error Detection and CompensationPsychological Science, 1993
- Architecture and intrinsic connections of the prefrontal cortex in the rhesus monkeyJournal of Comparative Neurology, 1989
- Noradrenergic and serotoninergic innervation of cortical, thalamic, and tectal visual structures in old and new world monkeysJournal of Comparative Neurology, 1986