An Introduction to Reinforcement Learning Theory: Value Function Methods

Publisher Website

30 January 2003

book chapter
Published by Springer Nature

p. 184-202
https://doi.org/10.1007/3-540-36434-x_5

Abstract

No abstract available

Keywords

This publication has 4 references indexed in Scilit:

Infinite-Horizon Policy-Gradient Estimation
Journal of Artificial Intelligence Research, 2001
An analysis of temporal-difference learning with function approximation
IEEE Transactions on Automatic Control, 1997
TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play
Neural Computation, 1994
Non-negative Matrices and Markov Chains
Published by Springer Nature ,1981