Residual Algorithms: Reinforcement Learning with Function Approximation

Publisher Website

1 January 1995

book chapter
Published by Elsevier

p. 30-37
https://doi.org/10.1016/b978-1-55860-377-6.50013-x

Abstract

No abstract available

This publication has 6 references indexed in Scilit:

Technical Note: Q-Learning
Machine Learning, 1992
Practical Issues in Temporal Difference Learning
Machine Learning, 1992
Consistency of HDP applied to a simple reinforcement learning problem
Neural Networks, 1990
Neurogammon: a neural-network backgammon program
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1990
Learning to predict by the methods of temporal differences
Machine Learning, 1988
Learning representations by back-propagating errors
Nature, 1986