Reinforcement learning with replacing eligibility traces

Publisher Website

1 January 1996

journal article
Published by Springer Nature in Machine Learning

Vol. 22 (1-3) , 123-158
https://doi.org/10.1007/bf00114726

Abstract

No abstract available

This publication has 12 references indexed in Scilit:

TD Models: Modeling the World at a Mixture of Time Scales
Published by Elsevier ,1995
On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
Neural Computation, 1994
Improving Generalization for Temporal Difference Learning: The Successor Representation
Neural Computation, 1993
Temporal-difference methods and Markov models
IEEE Transactions on Systems, Man, and Cybernetics, 1993
Online Learning with Random Representations
Published by Elsevier ,1993
The Convergence of TD(λ) for General λ
Machine Learning, 1992
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching
Machine Learning, 1992
Practical Issues in Temporal Difference Learning
Machine Learning, 1992
CMAC: an associative neural network alternative to backpropagation
Proceedings of the IEEE, 1990
Simulation and the Monte Carlo Method
Published by Wiley ,1981