Active Exploration and Learning in Real-Valued Spaces using Multi-Armed Bandit Allocation Indices

Abstract
No abstract available

This publication has 6 references indexed in Scilit: