Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming
- 1 June 1977
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research
- Vol. 25 (3) , 529-533
- https://doi.org/10.1287/opre.25.3.529
Abstract
In a simply connected Markov renewal problem, each state is either transient under all policies or an element of a single chain under some policy. This property is easily verified; it implies invariance of the maximal long-term average return (gain) with respect to the initial state, which in turn assures convergence of Odoni's bounds in the damped value-iteration algorithm due to Schweitzer, even when the maximal-gain process is multiple-chained and/or periodic.Keywords
This publication has 0 references indexed in Scilit: