Normalized Markov Decision Chains I; Sensitive Discount Optimality

1 August 1975

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research

Vol. 23 (4) , 785-795
https://doi.org/10.1287/opre.23.4.785

Abstract

In this paper we study sensitive discount optimality criteria for finite state and action, discrete time parameter, stationary generalized Markov decision chains. We extend previous results obtained by Miller and Veinott and Veinott for substochastic transition matrices to arbitrary non-negative matrices with spectral radius not exceeding one. In particular, we generalize their policy improvement algorithm for finding a stationary policy maximizing the expected discounted reward for all sufficiently small positive interest rates.

Keywords

OPTIMALITY
DISCOUNTED
VEINOTT
MILLER
SUFFICIENTLY
REWARD
DISCRETE
RADIUS
EXCEEDING
MAXIMIZING

This publication has 0 references indexed in Scilit: