Countable-State, Continuous-Time Dynamic Programming with Structure
- 1 June 1976
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research
- Vol. 24 (3) , 477-490
- https://doi.org/10.1287/opre.24.3.477
Abstract
We consider the problem P of maximizing the expected discounted reward earned in a continuous-time Markov decision process with countable state and finite action space. (The reward rate is merely bounded by a polynomial.) By examining a sequence 〈pN〉 of approximating problems, each of which is a semi-Markov decision process with exponential transition rate ΛN, ΛN ↗ ∞, we are able to show that P is obtained as the limit of the PN. The value in representing P as the limit of PN is that structural properties present in each PN persist, in both the finite and the infinite horizon problem. Three queuing optimization models illustrating the method are given.Keywords
This publication has 0 references indexed in Scilit: