Linear Programming for Finite State Multi-Armed Bandit Problems

Abstract
We consider the multi-armed bandit problem. We show that when the state space is finite the computation of the dynamic allocation indices can be handled by linear programming methods.

This publication has 0 references indexed in Scilit: