Residual Algorithms: Reinforcement Learning with Function Approximation
- 1 January 1995
- book chapter
- Published by Elsevier
Abstract
No abstract availableThis publication has 6 references indexed in Scilit:
- Technical Note: Q-LearningMachine Learning, 1992
- Practical Issues in Temporal Difference LearningMachine Learning, 1992
- Consistency of HDP applied to a simple reinforcement learning problemNeural Networks, 1990
- Neurogammon: a neural-network backgammon programPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990
- Learning to predict by the methods of temporal differencesMachine Learning, 1988
- Learning representations by back-propagating errorsNature, 1986