Markov decision chains with unbounded costs and applications to the control of queues
- 1 March 1976
- journal article
- Published by Cambridge University Press (CUP) in Advances in Applied Probability
- Vol. 8 (1) , 159-176
- https://doi.org/10.2307/1426027
Abstract
A discrete-time Markov decision model with a denumerable set of states and unbounded costs is considered. It is shown that the optimality equation of dynamic programming along with some additional, easily checked, conditions may be used to establish the optimality or ∊ -optimality of policies with respect to the average expected cost criterion. The results are used to derive optimal policies in two queueing examples.Keywords
This publication has 1 reference indexed in Scilit:
- Optimal decision procedures for finite markov chains. Part I: ExamplesAdvances in Applied Probability, 1973