SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 781–790 of 1262 papers

Title	Date	Tasks	Status
Linear Contextual Bandits with Interference	Sep 24, 2024	Causal InferenceDecision Making	—Unverified
Linear Contextual Bandits with Knapsacks	Jul 24, 2015	Multi-Armed Bandits	—Unverified
Lipschitz Bandits: Regret Lower Bounds and Optimal Algorithms	May 19, 2014	Multi-Armed Bandits	—Unverified
LLMs-augmented Contextual Bandit	Nov 3, 2023	Multi-Armed Banditsreinforcement-learning	—Unverified
Local Clustering in Contextual Multi-Armed Bandits	Feb 26, 2021	ClusteringMulti-Armed Bandits	—Unverified
Local Differential Privacy for Sequential Decision Making in a Changing Environment	Jan 2, 2023	Decision MakingMulti-Armed Bandits	—Unverified
(Locally) Differentially Private Combinatorial Semi-Bandits	Jun 1, 2020	Multi-Armed BanditsPrivacy Preserving	—Unverified
Make the Minority Great Again: First-Order Regret Bound for Contextual Bandits	Feb 9, 2018	Multi-Armed Bandits	—Unverified
Making Contextual Decisions with Low Technical Debt	Jun 13, 2016	Multi-Armed Bandits	—Unverified
Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery	May 3, 2024	Decision MakingInterpretable Machine Learning	—Unverified

Show:10 25 50

← PrevPage 79 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified