Risk-Sensitive Markov Decision Processes

1 March 1972

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Management Science

Vol. 18 (7) , 356-369
https://doi.org/10.1287/mnsc.18.7.356

Abstract

This paper considers the maximization of certain equivalent reward generated by a Markov decision process with constant risk sensitivity. First, value iteration is used to optimize possibly time-varying processes of finite duration. Then a policy iteration procedure is developed to find the stationary policy with highest certain equivalent gain for the infinite duration case. A simple example demonstrates both procedures.

Keywords

OPTIMIZE
DECISION PROCESS
MARKOV DECISION
HIGHEST
REWARD
POSSIBLY
INFINITE
MAXIMIZATION

This publication has 0 references indexed in Scilit: