Reinforcement learning with replacing eligibility traces
- 1 January 1996
- journal article
- Published by Springer Nature in Machine Learning
- Vol. 22 (1-3) , 123-158
- https://doi.org/10.1007/bf00114726
Abstract
No abstract availableThis publication has 12 references indexed in Scilit:
- TD Models: Modeling the World at a Mixture of Time ScalesPublished by Elsevier ,1995
- On the Convergence of Stochastic Iterative Dynamic Programming AlgorithmsNeural Computation, 1994
- Improving Generalization for Temporal Difference Learning: The Successor RepresentationNeural Computation, 1993
- Temporal-difference methods and Markov modelsIEEE Transactions on Systems, Man, and Cybernetics, 1993
- Online Learning with Random RepresentationsPublished by Elsevier ,1993
- The Convergence of TD(λ) for General λMachine Learning, 1992
- Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and TeachingMachine Learning, 1992
- Practical Issues in Temporal Difference LearningMachine Learning, 1992
- CMAC: an associative neural network alternative to backpropagationProceedings of the IEEE, 1990
- Simulation and the Monte Carlo MethodPublished by Wiley ,1981