The Loss from Imperfect Value Functions in Expectation-Based and Minimax-Based Tasks

Publisher Website

21 August 2007

book chapter
Published by Springer Nature

p. 197-225
https://doi.org/10.1007/978-0-585-33656-5_9

Abstract

No abstract available

Keywords

This publication has 4 references indexed in Scilit:

The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
Machine Learning, 1995
Q-learning
Machine Learning, 1992
Practical Issues in Temporal Difference Learning
Machine Learning, 1992
Neuronlike adaptive elements that can solve difficult learning control problems
IEEE Transactions on Systems, Man, and Cybernetics, 1983