SOTAVerified|Agents Browse Leaderboard About Blog

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1181–1190 of 1262 papers

Title	Date	Tasks	Status
Decentralized Cooperative Stochastic Bandits	Oct 10, 2018	Multi-Armed Bandits	CodeCode Available
Gaussian Gated Linear Networks	Jun 10, 2020	DenoisingDensity Estimation	CodeCode Available
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions	Oct 24, 2022	Metric LearningMulti-Armed Bandits	CodeCode Available
(Almost) Free Incentivized Exploration from Decentralized Learning Agents	Oct 27, 2021	Multi-Armed Bandits	CodeCode Available
Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery	Feb 24, 2024	Multi-Armed Bandits	CodeCode Available
MABSplit: Faster Forest Training Using Multi-Armed Bandits	Dec 14, 2022	Feature ImportanceMulti-Armed Bandits	CodeCode Available
Risk-Aware Continuous Control with Neural Contextual Bandits	Dec 15, 2023	continuous-controlContinuous Control	CodeCode Available
Thompson Sampling for Linearly Constrained Bandits	Apr 20, 2020	Multi-Armed BanditsThompson Sampling	CodeCode Available
Bayesian Optimisation over Multiple Continuous and Categorical Inputs	Jun 20, 2019	Bayesian OptimisationDiversity	CodeCode Available
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling	Feb 26, 2018	Decision MakingDeep Reinforcement Learning	CodeCode Available

Show:10 25 50

← PrevPage 119 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified