SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 931–940 of 1262 papers

Title	Date	Tasks	Status	Hype
Efficient and Robust Algorithms for Adversarial Linear Contextual Bandits	Feb 1, 2020	Multi-Armed Bandits	—Unverified	0
Bandits with Knapsacks beyond the Worst-Case	Feb 1, 2020	Multi-Armed Bandits	—Unverified	0
Ballooning Multi-Armed Bandits	Jan 24, 2020	Multi-Armed Bandits	—Unverified	0
Incentivising Exploration and Recommendations for Contextual Bandits with Payments	Jan 22, 2020	Multi-Armed Bandits	—Unverified	0
Exploration Through Bias: Revisiting Biased Maximum Likelihood Estimation in Stochastic Multi-Armed Bandits	Jan 1, 2020	Multi-Armed Bandits	—Unverified	0
Gradient-free Online Learning in Continuous Games with Delayed Rewards	Jan 1, 2020	Multi-Armed BanditsRecommendation Systems	—Unverified	0
Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits	Jan 1, 2020	Multi-Armed Bandits	—Unverified	0
A Modern Introduction to Online Learning	Dec 31, 2019	AllMulti-Armed Bandits	CodeCode Available	1
Fair Contextual Multi-Armed Bandits: Theory and Experiments	Dec 13, 2019	Decision MakingFairness	—Unverified	0
Sublinear Optimal Policy Value Estimation in Contextual Bandits	Dec 12, 2019	Multi-Armed Bandits	—Unverified	0

Show:10 25 50

← PrevPage 94 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified