Average optimal policies in Markov decision drift processes with applications to a queueing and a replacement model
- 1 June 1983
- journal article
- Published by Cambridge University Press (CUP) in Advances in Applied Probability
- Vol. 15 (2) , 274-303
- https://doi.org/10.2307/1426437
Abstract
Recently the authors introduced the concept of Markov decision drift processes. A Markov decision drift process can be seen as a straightforward generalization of a Markov decision process with continuous time parameter. In this paper we investigate the existence of stationary average optimal policies for Markov decision drift processes. Using a well-known Abelian theorem we derive sufficient conditions, which guarantee that a ‘limit point' of a sequence of discounted optimal policies with the discounting factor approaching 1 is an average optimal policy. An alternative set of sufficient conditions is obtained for the case in which the discounted optimal policies generate regenerative stochastic processes. The latter set of conditions is easier to verify in several applications. The results of this paper are also applicable to Markov decision processes with discrete or continuous time parameter and to semi-Markov decision processes. In this sense they generalize some well-known results for Markov decision processes with finite or compact action space. Applications to an M/M/1 queueing model and a maintenance replacement model are given. It is shown that under certain conditions on the model parameters the average optimal policy for the M/M/1 queueing model is monotone non-decreasing (as a function of the number of waiting customers) with respect to the service intensity and monotone non-increasing with respect to the arrival intensity. For the maintenance replacement model we prove the average optimality of a bang-bang type policy. Special attention is paid to the computation of the optimal control parameters.Keywords
This publication has 26 references indexed in Scilit:
- Optimal control of random walks, birth and death processes, and queuesAdvances in Applied Probability, 1981
- Inventory control with two switch-over levels for a class of M/G/1 queueing systems with variable arrival and service rateStochastic Processes and their Applications, 1978
- Optimal replacement under additive damage and other failure modelsNaval Research Logistics Quarterly, 1975
- An Upper Bound for the Renewal FunctionThe Annals of Mathematical Statistics, 1972
- Optimal Control of a Service Facility with Variable Exponential Service Times and Constant Arrival RateManagement Science, 1972
- On the Nonexistence of $|varepsilon$-Optimal Randomized Stationary Policies in Average Cost Markov Decision ModelsThe Annals of Mathematical Statistics, 1971
- A Solution to a Countable System of Equations Arising in Markovian Decision ProcessesThe Annals of Mathematical Statistics, 1967
- Denumerable State Markovian Decision Processes-Average Cost CriterionThe Annals of Mathematical Statistics, 1966
- Markovian Sequential Replacement ProcessesThe Annals of Mathematical Statistics, 1965
- Discrete Dynamic ProgrammingThe Annals of Mathematical Statistics, 1962