SOTAVerified|Agents Browse Leaderboard About Blog

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1231–1240 of 1262 papers

Title	Date	Tasks	Status
Human in the Loop Adaptive Optimization for Improved Time Series Forecasting	May 21, 2025	Language ModelingLanguage Modelling	CodeCode Available
Adversarial Attacks on Combinatorial Multi-Armed Bandits	Oct 8, 2023	Multi-Armed Bandits	CodeCode Available
Machine Teaching of Active Sequential Learners	Sep 8, 2018	Multi-Armed BanditsProbabilistic Programming	CodeCode Available
Doubly-Robust Lasso Bandit	Jul 26, 2019	Multi-Armed BanditsRecommendation Systems	CodeCode Available
A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit	Oct 2, 2015	Decision MakingMulti-Armed Bandits	CodeCode Available
Thompson Sampling via Local Uncertainty	Oct 30, 2019	Decision MakingMulti-Armed Bandits	CodeCode Available
Identification of the Generalized Condorcet Winner in Multi-dueling Bandits	Dec 1, 2021	Multi-Armed Bandits	CodeCode Available
SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits	Sep 21, 2018	Multi-Armed Bandits	CodeCode Available
Doubly Robust Policy Evaluation and Learning	Mar 23, 2011	Decision MakingMulti-Armed Bandits	CodeCode Available
Dual-Mandate Patrols: Multi-Armed Bandits for Green Security	Sep 14, 2020	Multi-Armed Bandits	CodeCode Available

Show:10 25 50

← PrevPage 124 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified