Learning Algorithms for Markov Decision Processes with Average Cost
- 1 January 2001
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Control and Optimization
- Vol. 40 (3) , 681-698
- https://doi.org/10.1137/s0363012999361974
Abstract
No abstract availableThis publication has 17 references indexed in Scilit:
- A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROLProbability in the Engineering and Informational Sciences, 2000
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement LearningSIAM Journal on Control and Optimization, 2000
- Rollout Algorithms for Stochastic Scheduling ProblemsJournal of Heuristics, 1999
- Asynchronous Stochastic ApproximationsSIAM Journal on Control and Optimization, 1998
- A New Value Iteration method for the Average Cost Dynamic Programming ProblemSIAM Journal on Control and Optimization, 1998
- An analog scheme for fixed point computation. I. TheoryIEEE Transactions on Circuits and Systems I: Regular Papers, 1997
- Stochastic approximation with two time scalesSystems & Control Letters, 1997
- Recursive self-tuning control of finite Markov chainsApplicationes Mathematicae, 1997
- Adaptive Algorithms and Stochastic ApproximationsPublished by Springer Nature ,1990
- Distributed dynamic programmingIEEE Transactions on Automatic Control, 1982