Countable-State, Continuous-Time Dynamic Programming with Structure

1 June 1976

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research

Vol. 24 (3) , 477-490
https://doi.org/10.1287/opre.24.3.477

Abstract

We consider the problem P of maximizing the expected discounted reward earned in a continuous-time Markov decision process with countable state and finite action space. (The reward rate is merely bounded by a polynomial.) By examining a sequence 〈p_N〉 of approximating problems, each of which is a semi-Markov decision process with exponential transition rate Λ_N, Λ_N ↗ ∞, we are able to show that P is obtained as the limit of the P_N. The value in representing P as the limit of P_N is that structural properties present in each P_N persist, in both the finite and the infinite horizon problem. Three queuing optimization models illustrating the method are given.

Keywords

This publication has 0 references indexed in Scilit: