On the evaluation of suboptimal strategies for families of alternative bandit processes
- 1 September 1982
- journal article
- Published by Cambridge University Press (CUP) in Journal of Applied Probability
- Vol. 19 (3) , 716-722
- https://doi.org/10.2307/3213534
Abstract
Families of alternative bandit processes have been used as models for problems in a variety of areas. Optimal strategies for these decision processes are determined by dynamic allocation indices. These indices are here shown to play an important role in the evaluation of suboptimal strategies.Keywords
This publication has 9 references indexed in Scilit:
- Some best possible results for a discounted one armed banditMetrika, 1983
- Randomized Allocation of Treatments in Sequential ExperimentsJournal of the Royal Statistical Society Series B: Statistical Methodology, 1981
- On Randomized Dynamic Allocation Indices for the Sequential Design of ExperimentsJournal of the Royal Statistical Society Series B: Statistical Methodology, 1980
- Multi-Armed Bandits and the Gittins IndexJournal of the Royal Statistical Society Series B: Statistical Methodology, 1980
- Der diskontierte Einarmige BanditMetrika, 1979
- Bandit Processes and Dynamic Allocation IndicesJournal of the Royal Statistical Society Series B: Statistical Methodology, 1979
- On the optimal allocation of two or more treatments in a controlled clinical trialBiometrika, 1978
- On Bayesian models in stochastic schedulingJournal of Applied Probability, 1977
- Stochastic scheduling with order constraintsInternational Journal of Systems Science, 1976