The method of value oriented successive approximations for the average reward Markov decision process
- 1 December 1980
- journal article
- review article
- Published by Springer Nature in OR Spectrum
- Vol. 1 (4) , 233-242
- https://doi.org/10.1007/bf01719500
Abstract
No abstract availableKeywords
This publication has 15 references indexed in Scilit:
- Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal ProgrammingOperations Research, 1977
- Discounting, Ergodicity and Convergence for Markov Decision ProcessesManagement Science, 1977
- Technical Note—The Method of Successive Approximations and Markovian Decision ProblemsOperations Research, 1974
- Optimal decision procedures for finite Markov chains. Part II: Communicating systemsAdvances in Applied Probability, 1973
- A Markov Decision ProblemPublished by Elsevier ,1973
- Some Bounds for Discounted Sequential Decision ProcessesManagement Science, 1971
- Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive ApproximationsOperations Research, 1971
- Technical Note—Bounds on the Gain of a Markov Decision ProcessOperations Research, 1971
- On Finding the Maximal Gain for Markov Decision ProcessesOperations Research, 1969
- Weak ergodicity in non-homogeneous Markov chainsMathematical Proceedings of the Cambridge Philosophical Society, 1958