An Introduction to Reinforcement Learning Theory: Value Function Methods
- 30 January 2003
- book chapter
- Published by Springer Nature
- p. 184-202
- https://doi.org/10.1007/3-540-36434-x_5
Abstract
No abstract availableKeywords
This publication has 4 references indexed in Scilit:
- Infinite-Horizon Policy-Gradient EstimationJournal of Artificial Intelligence Research, 2001
- An analysis of temporal-difference learning with function approximationIEEE Transactions on Automatic Control, 1997
- TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level PlayNeural Computation, 1994
- Non-negative Matrices and Markov ChainsPublished by Springer Nature ,1981