Q-learning

Publisher Website

1 May 1992

journal article
Published by Springer Nature in Machine Learning

Vol. 8 (3-4) , 279-292
https://doi.org/10.1007/bf00992698

Abstract

No abstract available

Keywords

This publication has 4 references indexed in Scilit:

Self-improving reactive agents based on reinforcement learning, planning and teaching
Machine Learning, 1992
Learning control of finite Markov chains with an explicit trade-off between estimation and control
IEEE Transactions on Systems, Man, and Cybernetics, 1988
Stochastic Approximation Methods for Constrained and Unconstrained Systems
Published by Springer Nature ,1978
Applied Dynamic Programming
Published by Walter de Gruyter GmbH ,1962