SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1031–1040 of 1262 papers

Title	Date	Tasks	Status
Better Algorithms for Stochastic Bandits with Adversarial Corruptions	Feb 22, 2019	Multi-Armed Bandits	—Unverified
AdaLinUCB: Opportunistic Learning for Contextual Bandits	Feb 20, 2019	Multi-Armed Bandits	—Unverified
Equal Opportunity in Online Classification with Partial Feedback	Feb 6, 2019	ClassificationDecision Making Under Uncertainty	CodeCode Available
Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting	Feb 5, 2019	Multi-Armed Bandits	—Unverified
A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free	Feb 3, 2019	Multi-Armed Bandits	—Unverified
Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards	Feb 3, 2019	Multi-Armed Bandits	—Unverified
On the bias, risk and consistency of sample means in multi-armed bandits	Feb 2, 2019	Multi-Armed BanditsSelection bias	—Unverified
Target Tracking for Contextual Bandits: Application to Demand Side Management	Jan 28, 2019	ManagementMulti-Armed Bandits	—Unverified
Almost Boltzmann Exploration	Jan 25, 2019	Multi-Armed BanditsReinforcement Learning	—Unverified
The Assistive Multi-Armed Bandit	Jan 24, 2019	Multi-Armed Bandits	CodeCode Available

Show:10 25 50

← PrevPage 104 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified