Learning Rates for Q-Learning
- 13 September 2001
- book chapter
- Published by Springer Nature
- p. 589-604
- https://doi.org/10.1007/3-540-44581-1_39
Abstract
No abstract availableKeywords
This publication has 4 references indexed in Scilit:
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement LearningSIAM Journal on Control and Optimization, 2000
- On the Convergence of Stochastic Iterative Dynamic Programming AlgorithmsNeural Computation, 1994
- Markov Decision ProcessesPublished by Wiley ,1994
- Technical Note: Q-LearningMachine Learning, 1992