Q-learning
- 1 May 1992
- journal article
- Published by Springer Nature in Machine Learning
- Vol. 8 (3-4) , 279-292
- https://doi.org/10.1007/bf00992698
Abstract
No abstract availableKeywords
This publication has 4 references indexed in Scilit:
- Self-improving reactive agents based on reinforcement learning, planning and teachingMachine Learning, 1992
- Learning control of finite Markov chains with an explicit trade-off between estimation and controlIEEE Transactions on Systems, Man, and Cybernetics, 1988
- Stochastic Approximation Methods for Constrained and Unconstrained SystemsPublished by Springer Nature ,1978
- Applied Dynamic ProgrammingPublished by Walter de Gruyter GmbH ,1962