Normalized Markov Decision Chains I; Sensitive Discount Optimality
- 1 August 1975
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research
- Vol. 23 (4) , 785-795
- https://doi.org/10.1287/opre.23.4.785
Abstract
In this paper we study sensitive discount optimality criteria for finite state and action, discrete time parameter, stationary generalized Markov decision chains. We extend previous results obtained by Miller and Veinott and Veinott for substochastic transition matrices to arbitrary non-negative matrices with spectral radius not exceeding one. In particular, we generalize their policy improvement algorithm for finding a stationary policy maximizing the expected discounted reward for all sufficiently small positive interest rates.Keywords
This publication has 0 references indexed in Scilit: