SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–160 of 1262 papers

Title	Date	Tasks	Status
Bandit Regret Scaling with the Effective Loss Range	May 15, 2017	Multi-Armed Bandits	—Unverified
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits	Oct 13, 2021	Machine TranslationMulti-Armed Bandits	—Unverified
Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits	Nov 1, 2021	Machine TranslationMulti-Armed Bandits	—Unverified
Bandits for Learning to Explain from Explanations	Feb 7, 2021	Gaussian ProcessesMulti-Armed Bandits	—Unverified
Bandits meet Computer Architecture: Designing a Smartly-allocated Cache	Jan 31, 2016	Multi-Armed Bandits	—Unverified
Bandit Social Learning: Exploration under Myopic Behavior	Feb 15, 2023	Multi-Armed Bandits	—Unverified
Bandits Warm-up Cold Recommender Systems	Jul 10, 2014	Multi-Armed BanditsRecommendation Systems	—Unverified
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms	Jul 21, 2023	Multi-Armed BanditsRecommendation Systems	—Unverified
Bandits with Knapsacks beyond the Worst Case	Dec 1, 2021	Multi-Armed Bandits	—Unverified
A Gang of Bandits	Jun 4, 2013	ClusteringMulti-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 16 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified