A Reinforcement Learning Method for Maximizing Undiscounted Rewards

Publisher Website

1 January 1993

book chapter
Published by Elsevier

p. 298-305
https://doi.org/10.1016/b978-1-55860-307-3.50045-9

Abstract

No abstract available

This publication has 6 references indexed in Scilit:

Q-learning
Machine Learning, 1992
Using Transitional Proximity for Faster Reinforcement Learning
Published by Elsevier ,1992
Transfer of Learning by Composing Solutions of Elemental Sequential Tasks
Machine Learning, 1992
Quasimorphisms or Queasymorphisms? Modeling Finite Automaton Environments
Published by Elsevier ,1991
Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
Published by Elsevier ,1990
Discrete Dynamic Programming
The Annals of Mathematical Statistics, 1962