SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 911–920 of 1262 papers

Title	Date	Tasks	Status
Model Selection in Contextual Stochastic Bandit Problems	Mar 3, 2020	modelModel Selection	—Unverified
Bounded Regret for Finitely Parameterized Multi-Armed Bandits	Mar 3, 2020	Multi-Armed Bandits	—Unverified
Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits	Mar 3, 2020	Decision MakingMulti-Armed Bandits	—Unverified
Decentralized Multi-player Multi-armed Bandits with No Collision Information	Feb 29, 2020	Multi-Armed Bandits	—Unverified
Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions	Feb 26, 2020	Multi-Armed Bandits	—Unverified
Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis	Feb 26, 2020	Multi-Armed Bandits	—Unverified
Bandit Learning with Delayed Impact of Actions	Feb 24, 2020	FairnessMulti-Armed Bandits	—Unverified
The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms	Feb 24, 2020	Multi-Armed Bandits	CodeCode Available
Survey Bandits with Regret Guarantees	Feb 23, 2020	Multi-Armed BanditsSurvey	—Unverified
Online Learning in Contextual Bandits using Gated Linear Networks	Feb 21, 2020	Multi-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 92 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified