Myopic Solutions of Markov Decision Processes and Stochastic Games

1 October 1981

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research

Vol. 29 (5) , 995-1009
https://doi.org/10.1287/opre.29.5.995

Abstract

Sufficient conditions are presented for a Markov decision process to have a myopic optimum and for a stochastic game to possess a myopic equilibrium point. An optimum (or an equilibrium point) is said to be “myopic” if it can be deduced from an optimum (or an equilibrium point) of a static optimization problem (or a static [Nash] game). The principal conditions are (a) each single period reward is the sum of terms due to the current state and action, (b) each transition probability depends on the action taken but not on the state from which the transition occurs, and (c) an appropriate static optimum (or equilibrium point) is ad infinitum repeatable. These conditions are satisfied by several dynamic oligopoly models and numerous Markov decision processes.

Keywords

This publication has 0 references indexed in Scilit: