The convergence of TD(?) for general ?
- 1 May 1992
- journal article
- Published by Springer Nature in Machine Learning
- Vol. 8 (3-4) , 341-362
- https://doi.org/10.1007/bf00992701
Abstract
No abstract availableKeywords
This publication has 8 references indexed in Scilit:
- Consistency of HDP applied to a simple reinforcement learning problemNeural Networks, 1990
- Connectionistic Problem SolvingPublished by Springer Nature ,1990
- Neuronlike adaptive elements that can solve difficult learning control problemsIEEE Transactions on Systems, Man, and Cybernetics, 1983
- An adaptive optimal controller for discrete-time Markov environmentsInformation and Control, 1977
- Heat Transfer Augmentation in Laminar Fully Developed Channel Flow by Means of Heating From BelowJournal of Heat Transfer, 1975
- Some Studies in Machine Learning Using the Game of Checkers. II—Recent ProgressIBM Journal of Research and Development, 1967
- Applied Dynamic ProgrammingPublished by Walter de Gruyter GmbH ,1962
- Some Studies in Machine Learning Using the Game of CheckersIBM Journal of Research and Development, 1959