Abstract
An optimal policy for a class of non-Markov decision processes is characterized by means of the theory of optimal control. This policy is found to take the form of a ranking of all the available actions by means of an index and using at all times the action which has the smallest associated index.

This publication has 5 references indexed in Scilit: