SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 611–620 of 1262 papers

Title	Date	Tasks	Status
Efficient Kernel UCB for Contextual Bandits	Feb 11, 2022	Computational EfficiencyMulti-Armed Bandits	CodeCode Available
Shuffle Private Linear Contextual Bandits	Feb 11, 2022	Multi-Armed Bandits	—Unverified
Remote Contextual Bandits	Feb 10, 2022	MarketingMulti-Armed Bandits	—Unverified
Settling the Communication Complexity for Distributed Offline Reinforcement Learning	Feb 10, 2022	Multi-Armed BanditsOffline RL	—Unverified
Smoothed Online Learning is as Easy as Statistical Learning	Feb 9, 2022	Learning TheoryMulti-Armed Bandits	—Unverified
Budgeted Combinatorial Multi-Armed Bandits	Feb 8, 2022	Multi-Armed Bandits	—Unverified
Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits	Feb 3, 2022	counterfactualMulti-Armed Bandits	—Unverified
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts	Feb 2, 2022	Multi-Armed Bandits	—Unverified
Adaptive Experimentation with Delayed Binary Feedback	Feb 2, 2022	Multi-Armed Banditsvalid	CodeCode Available
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health	Feb 2, 2022	Multi-Armed BanditsScheduling	—Unverified

Show:10 25 50

← PrevPage 62 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified