Learning Algorithms for Markov Decision Processes with Average Cost

Publisher Website

1 January 2001

journal article
Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Control and Optimization

Vol. 40 (3) , 681-698
https://doi.org/10.1137/s0363012999361974

Abstract

No abstract available

This publication has 17 references indexed in Scilit:

A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL
Probability in the Engineering and Informational Sciences, 2000
The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
SIAM Journal on Control and Optimization, 2000
Rollout Algorithms for Stochastic Scheduling Problems
Journal of Heuristics, 1999
Asynchronous Stochastic Approximations
SIAM Journal on Control and Optimization, 1998
A New Value Iteration method for the Average Cost Dynamic Programming Problem
SIAM Journal on Control and Optimization, 1998
An analog scheme for fixed point computation. I. Theory
IEEE Transactions on Circuits and Systems I: Regular Papers, 1997
Stochastic approximation with two time scales
Systems & Control Letters, 1997
Recursive self-tuning control of finite Markov chains
Applicationes Mathematicae, 1997
Adaptive Algorithms and Stochastic Approximations
Published by Springer Nature ,1990
Distributed dynamic programming
IEEE Transactions on Automatic Control, 1982