Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming

1 June 1977

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research

Vol. 25 (3) , 529-533
https://doi.org/10.1287/opre.25.3.529

Abstract

In a simply connected Markov renewal problem, each state is either transient under all policies or an element of a single chain under some policy. This property is easily verified; it implies invariance of the maximal long-term average return (gain) with respect to the initial state, which in turn assures convergence of Odoni's bounds in the damped value-iteration algorithm due to Schweitzer, even when the maximal-gain process is multiple-chained and/or periodic.

Keywords

CONVERGENCE
MARKOV
EASILY
ITERATION
TRANSIENT
SCHWEITZER
ODONI'S
ASSURES
RETURN
SIMPLY

This publication has 0 references indexed in Scilit: