The actor-critic algorithm as multi-time-scale stochastic approximation

Open Access

Publisher Website

1 August 1997

journal article
Published by Springer Nature in Sādhanā

Vol. 22 (4) , 525-543
https://doi.org/10.1007/bf02745577

Abstract

No abstract available

Keywords

This publication has 10 references indexed in Scilit:

A reinforcement learning neural network for adaptive control of Markov chains
IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 1997
A tutorial survey of reinforcement learning
Sādhanā, 1994
Markov Decision Processes
Published by Wiley ,1994
Nonconvergence to Unstable Points in Urn Models and Stochastic Approximations
The Annals of Probability, 1990
Adaptive Algorithms and Stochastic Approximations
Published by Springer Nature ,1990
Convergent activation dynamics in continuous time networks
Neural Networks, 1989
Estimation and control in discounted stochastic dynamic programming
Stochastics, 1987
Generalized polynomial approximations in Markovian decision processes
Journal of Mathematical Analysis and Applications, 1985
Stochastic Approximation Methods for Constrained and Unconstrained Systems
Published by Springer Nature ,1978
Chaotic relaxation
Linear Algebra and its Applications, 1969