Characterization of Optimal Policies in Vector-Valued Markovian Decision Processes

1 May 1980

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Mathematics of Operations Research

Vol. 5 (2) , 271-279
https://doi.org/10.1287/moor.5.2.271

Abstract

Infinite horizon Markovian decision processes with R^p-valued additive utilities are considered. The optimization criterion, here, is a pseudo-order preference relation induced by a convex cone in R^p. The state space is a countable set, and the action space is a compact metric space. Certain assumptions on the continuity of the reward vector and the transition probability are made. In this setting, an algorithm improving policies with respect to the chosen preference relation is given. A point-to-set mapping is defined, and optimal policies are characterized by fixed points of the mapping which are maximal in the set of all fixed points.

Keywords

This publication has 0 references indexed in Scilit: