A Reinforcement Learning Method for Maximizing Undiscounted Rewards
- 1 January 1993
- book chapter
- Published by Elsevier
Abstract
No abstract availableThis publication has 6 references indexed in Scilit:
- Q-learningMachine Learning, 1992
- Using Transitional Proximity for Faster Reinforcement LearningPublished by Elsevier ,1992
- Transfer of Learning by Composing Solutions of Elemental Sequential TasksMachine Learning, 1992
- Quasimorphisms or Queasymorphisms? Modeling Finite Automaton EnvironmentsPublished by Elsevier ,1991
- Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic ProgrammingPublished by Elsevier ,1990
- Discrete Dynamic ProgrammingThe Annals of Mathematical Statistics, 1962