Abstract
A test for nonoptimal actions in undiscounted Markov decision chains is proposed. The test eliminates actions for one or more stages after which they may re-enter the set of possibly optimal actions, but as convergence proceeds such re-entries cease.

This publication has 0 references indexed in Scilit: