Optimal strategies for families of alternative bandit processes

1 August 1983

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Automatic Control

Vol. 28 (8) , 858-861
https://doi.org/10.1109/tac.1983.1103327

Abstract

Many stochastic resource allocation problems may be formulated as families of alternative bandit processes. One example is the classical one-armed bandit problem recently studied by Kumar and Seidman. Optimal strategies for such problems are known to be determined by a collection of dynamic allocation indexes (DAI's). The aim of this note is to bring this important result to the attention of control theorists and to give a new proof of it. Applications and some related work are also discussed.

Keywords

This publication has 17 references indexed in Scilit:

Some best possible results for a discounted one armed bandit
Metrika, 1983
On the evaluation of suboptimal strategies for families of alternative bandit processes
Journal of Applied Probability, 1982
On a sufficient condition for superprocesses due to whittle
Journal of Applied Probability, 1982
On the optimal solution of the one-armed bandit adaptive control problem
IEEE Transactions on Automatic Control, 1981
Der diskontierte Einarmige Bandit
Metrika, 1979
On the optimal allocation of two or more treatments in a controlled clinical trial
Biometrika, 1978
Multiple feedback at a single-server station
Stochastic Processes and their Applications, 1977
Scheduling, Queues and Dynamic Allocation Indices
Published by Springer Nature ,1977
A profitability index for alternative research projects
Omega, 1976
Stochastic scheduling with order constraints
International Journal of Systems Science, 1976