Markov decision chains with unbounded costs and applications to the control of queues

1 March 1976

journal article
Published by Cambridge University Press (CUP) in Advances in Applied Probability

Vol. 8 (1) , 159-176
https://doi.org/10.2307/1426027

Abstract

A discrete-time Markov decision model with a denumerable set of states and unbounded costs is considered. It is shown that the optimality equation of dynamic programming along with some additional, easily checked, conditions may be used to establish the optimality or ∊ -optimality of policies with respect to the average expected cost criterion. The results are used to derive optimal policies in two queueing examples.

Keywords

OPTIMALITY
UNBOUNDED COSTS
MODEL
MARKOV
EASILY
CRITERION
DENUMERABLE
DISCRETE
CHECKED

This publication has 1 reference indexed in Scilit:

Optimal decision procedures for finite markov chains. Part I: Examples
Advances in Applied Probability, 1973