Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem
- 1 January 2003
- book chapter
- Published by Springer Nature
- Vol. 2777, 418-432
- https://doi.org/10.1007/978-3-540-45167-9_31
Abstract
No abstract availableThis publication has 8 references indexed in Scilit:
- PAC Bounds for Multi-armed Bandit and Markov Decision ProcessesPublished by Springer Nature ,2002
- The Nonstochastic Multiarmed Bandit ProblemSIAM Journal on Computing, 2002
- Finite-time lower bounds for the two-armed bandit problemIEEE Transactions on Automatic Control, 2000
- Neural Network LearningPublished by Cambridge University Press (CUP) ,1999
- Asymptotically efficient adaptive allocation rulesAdvances in Applied Mathematics, 1985
- Bandit problemsPublished by Springer Nature ,1985
- ASYMPTOTICALLY OPTIMAL PROCEDURES FOR SEQUENTIAL ADAPTIVE SELECTION OF THE BEST OF SEVERAL NORMAL MEANSPublished by Elsevier ,1982
- Some aspects of the sequential design of experimentsBulletin of the American Mathematical Society, 1952