Risk-Sensitive Markov Decision Processes
- 1 March 1972
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Management Science
- Vol. 18 (7) , 356-369
- https://doi.org/10.1287/mnsc.18.7.356
Abstract
This paper considers the maximization of certain equivalent reward generated by a Markov decision process with constant risk sensitivity. First, value iteration is used to optimize possibly time-varying processes of finite duration. Then a policy iteration procedure is developed to find the stationary policy with highest certain equivalent gain for the infinite duration case. A simple example demonstrates both procedures.Keywords
This publication has 0 references indexed in Scilit: