The method of value oriented successive approximations for the average reward Markov decision process

Publisher Website

1 December 1980

journal article
review article
Published by Springer Nature in OR Spectrum

Vol. 1 (4) , 233-242
https://doi.org/10.1007/bf01719500

Abstract

No abstract available

Keywords

This publication has 15 references indexed in Scilit:

Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming
Operations Research, 1977
Discounting, Ergodicity and Convergence for Markov Decision Processes
Management Science, 1977
Technical Note—The Method of Successive Approximations and Markovian Decision Problems
Operations Research, 1974
Optimal decision procedures for finite Markov chains. Part II: Communicating systems
Advances in Applied Probability, 1973
A Markov Decision Problem
Published by Elsevier ,1973
Some Bounds for Discounted Sequential Decision Processes
Management Science, 1971
Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations
Operations Research, 1971
Technical Note—Bounds on the Gain of a Markov Decision Process
Operations Research, 1971
On Finding the Maximal Gain for Markov Decision Processes
Operations Research, 1969
Weak ergodicity in non-homogeneous Markov chains
Mathematical Proceedings of the Cambridge Philosophical Society, 1958