SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 191–200 of 1262 papers

Title	Date	Tasks	Status	Score
Empirical analysis of representation learning and exploration in neural kernel bandits	Nov 5, 2021	Bayesian InferenceDecision Making	CodeCode Available	5
Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness	Feb 21, 2025	FairnessMulti-Armed Bandits	CodeCode Available	5
Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards	Jun 3, 2019	Multi-Armed Bandits	CodeCode Available	5
Multi-Armed Bandits in Brain-Computer Interfaces	May 19, 2022	Multi-Armed Bandits	CodeCode Available	5
Bandit-Based Monte Carlo Optimization for Nearest Neighbors	May 21, 2018	ClusteringMulti-Armed Bandits	CodeCode Available	5
Multi-Armed Bandits with Network Interference	May 28, 2024	Multi-Armed Bandits	CodeCode Available	5
An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits	Nov 9, 2023	Causal InferenceExperimental Design	CodeCode Available	5
Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming	May 25, 2018	Bayesian InferenceMulti-Armed Bandits	CodeCode Available	5
Model selection for contextual bandits	Jun 3, 2019	modelModel Selection	CodeCode Available	5
Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback	Sep 4, 2019	Multi-Armed Bandits	CodeCode Available	5

Show:10 25 50

← PrevPage 20 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified