SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 181–190 of 1262 papers

Title	Date	Tasks	Status
Efficient Prompt Optimization Through the Lens of Best Arm Identification	Feb 15, 2024	Instruction FollowingMulti-Armed Bandits	—Unverified
Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme	Jun 11, 2020	Multi-Armed Bandits	—Unverified
Best-Arm Identification in Correlated Multi-Armed Bandits	Sep 10, 2021	Multi-Armed Bandits	—Unverified
Best Arm Identification in Linked Bandits	Nov 19, 2018	Multi-Armed Bandits	—Unverified
Balanced off-policy evaluation in general action spaces	Jun 9, 2019	Binary Classificationcounterfactual	—Unverified
Best Arm Identification in Restless Markov Multi-Armed Bandits	Mar 29, 2022	Multi-Armed Bandits	—Unverified
Best Arm Identification in Stochastic Bandits: Beyond β-optimality	Jan 10, 2023	Computational EfficiencyMulti-Armed Bandits	—Unverified
Best Arm Identification under Additive Transfer Bandits	Dec 8, 2021	Multi-Armed BanditsTransfer Learning	—Unverified
An Empirical Evaluation of Thompson Sampling	Dec 1, 2011	Multi-Armed BanditsThompson Sampling	—Unverified
Balanced Linear Contextual Bandits	Dec 15, 2018	Causal InferenceMulti-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 19 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified