A general markov decision method I: Model and techniques

1 June 1977

journal article
Published by Cambridge University Press (CUP) in Advances in Applied Probability

Vol. 9 (2) , 296-315
https://doi.org/10.2307/1426388

Abstract

This paper provides a new approach for solving a wide class of Markov decision problems including problems in which the space is general and the system can be continuously controlled. The optimality criterion is the long-run average cost per unit time. We decompose the decision processes into a common underlying stochastic process and a sequence of interventions so that the decision processes can be embedded upon a reduced set of states. Consequently, in the policy-iteration algorithm resulting from this approach the number of equations to be solved in any iteration step can be substantially reduced. Further, by its flexibility, this algorithm allows us to exploit any structure of the particular problem to be solved.

Keywords

This publication has 4 references indexed in Scilit:

A general markov decision method II: Applications
Advances in Applied Probability, 1977
A Solution to a Countable System of Equations Arising in Markovian Decision Processes
The Annals of Mathematical Statistics, 1967
Some limit theorems for a general Markov process
Probability Theory and Related Fields, 1966
Markov-Renewal Programming. I: Formulation, Finite Return Models
Operations Research, 1963