Characterization of Optimal Policies in Vector-Valued Markovian Decision Processes
- 1 May 1980
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Mathematics of Operations Research
- Vol. 5 (2) , 271-279
- https://doi.org/10.1287/moor.5.2.271
Abstract
Infinite horizon Markovian decision processes with Rp-valued additive utilities are considered. The optimization criterion, here, is a pseudo-order preference relation induced by a convex cone in Rp. The state space is a countable set, and the action space is a compact metric space. Certain assumptions on the continuity of the reward vector and the transition probability are made. In this setting, an algorithm improving policies with respect to the chosen preference relation is given. A point-to-set mapping is defined, and optimal policies are characterized by fixed points of the mapping which are maximal in the set of all fixed points.Keywords
This publication has 0 references indexed in Scilit: