Evaluating Strategies for Markov Decision Processes in Parallel
- 1 February 1990
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Mathematics of Operations Research
- Vol. 15 (1) , 17-32
- https://doi.org/10.1287/moor.15.1.17
Abstract
The authors' work on the use of Gittins indices in the evaluation of strategies for families of alternative bandit processes has found many applications. Among these are procedures for sensitivity analysis in stochastic scheduling. This theoretical paper aims at developing results which will form the basis of an approach to strategy evaluation for a class of processes of greater complexity. These are Markov Decision Processes in parallel satisfying a condition first enunciated by Whittle. The theory of Gittins indices forms the basis of the analysis.Keywords
This publication has 0 references indexed in Scilit: