Branching Bandit Processes

27 July 1988

journal article
research article
Published by Cambridge University Press (CUP) in Probability in the Engineering and Informational Sciences

Vol. 2 (3) , 269-278
https://doi.org/10.1017/s0269964800000826

Abstract

A set of n_iarms of typei, i= 1,…, L, is available. A pull of arm of typeioccupies a duration V_iat the end of which a reward C_iand N_i1,…, N_iLnew arms are obtained, while all other arms are frozen. A Gittins priority order of types is obtained and shown to yield the maximal discounted reward from this branching process of arms.

Keywords

This publication has 17 references indexed in Scilit:

The Multi-Armed Bandit Problem: Decomposition and Computation
Mathematics of Operations Research, 1987
A Note on M. N. Katehakis' and Y.-R. Chen's Computation of the Gittins Index
Mathematics of Operations Research, 1986
Linear Programming for Finite State Multi-Armed Bandit Problems
Mathematics of Operations Research, 1986
Multi-Armed Bandits with Discount Factor Near One: The Bernoulli Case
The Annals of Statistics, 1981
Time-Sharing Service Systems. II
Theory of Probability and Its Applications, 1979
Bandit Processes and Dynamic Allocation Indices
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1979
A hamiltonian approach to optimal stochastic resource allocation
Advances in Applied Probability, 1977
Stochastic scheduling with order constraints
International Journal of Systems Science, 1976
Dynamic Scheduling of a Multiclass Queue: Discount Optimality
Operations Research, 1975
Scheduling for Minimum Total Loss Using Service Time Distributions
Journal of the ACM, 1974