SOTAVerified

Breaking the (1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids

2025-01-29Unverified0· sign in to hype

Tianyuan Jin, Qin Zhang, Dongruo Zhou

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We investigate the problem of batched best arm identification in multi-armed bandits, where we aim to identify the best arm from a set of n arms while minimizing both the number of samples and batches. We introduce an algorithm that achieves near-optimal sample complexity and features an instance-sensitive batch complexity, which breaks the (1/_2) barrier. The main contribution of our algorithm is a novel sample allocation scheme that effectively balances exploration and exploitation for batch sizes. Experimental results indicate that our approach is more batch-efficient across various setups. We also extend this framework to the problem of batched best arm identification in linear bandits and achieve similar improvements.

Tasks

Reproductions