Optimal strategies for families of alternative bandit processes
- 1 August 1983
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Automatic Control
- Vol. 28 (8) , 858-861
- https://doi.org/10.1109/tac.1983.1103327
Abstract
Many stochastic resource allocation problems may be formulated as families of alternative bandit processes. One example is the classical one-armed bandit problem recently studied by Kumar and Seidman. Optimal strategies for such problems are known to be determined by a collection of dynamic allocation indexes (DAI's). The aim of this note is to bring this important result to the attention of control theorists and to give a new proof of it. Applications and some related work are also discussed.Keywords
This publication has 17 references indexed in Scilit:
- Some best possible results for a discounted one armed banditMetrika, 1983
- On the evaluation of suboptimal strategies for families of alternative bandit processesJournal of Applied Probability, 1982
- On a sufficient condition for superprocesses due to whittleJournal of Applied Probability, 1982
- On the optimal solution of the one-armed bandit adaptive control problemIEEE Transactions on Automatic Control, 1981
- Der diskontierte Einarmige BanditMetrika, 1979
- On the optimal allocation of two or more treatments in a controlled clinical trialBiometrika, 1978
- Multiple feedback at a single-server stationStochastic Processes and their Applications, 1977
- Scheduling, Queues and Dynamic Allocation IndicesPublished by Springer Nature ,1977
- A profitability index for alternative research projectsOmega, 1976
- Stochastic scheduling with order constraintsInternational Journal of Systems Science, 1976