The Loss from Imperfect Value Functions in Expectation-Based and Minimax-Based Tasks
- 21 August 2007
- book chapter
- Published by Springer Nature
Abstract
No abstract availableKeywords
This publication has 4 references indexed in Scilit:
- The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spacesMachine Learning, 1995
- Q-learningMachine Learning, 1992
- Practical Issues in Temporal Difference LearningMachine Learning, 1992
- Neuronlike adaptive elements that can solve difficult learning control problemsIEEE Transactions on Systems, Man, and Cybernetics, 1983