SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 881–890 of 1262 papers

Title	Date	Tasks	Status
Gaussian Gated Linear Networks	Jun 10, 2020	DenoisingDensity Estimation	CodeCode Available
Distributionally Robust Batch Contextual Bandits	Jun 10, 2020	Multi-Armed Bandits	—Unverified
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition	Jun 10, 2020	Multi-Armed Bandits	—Unverified
Meta-Learning Bandit Policies by Gradient Ascent	Jun 9, 2020	Meta-LearningMulti-Armed Bandits	—Unverified
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior	Jun 9, 2020	Multi-Armed Banditsreinforcement-learning	CodeCode Available
Contextual Bandits with Side-Observations	Jun 6, 2020	Multi-Armed Bandits	—Unverified
Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs	Jun 5, 2020	Multi-Armed BanditsThompson Sampling	—Unverified
Locally Differentially Private (Contextual) Bandits Learning	Jun 1, 2020	Multi-Armed BanditsPrivacy Preserving Deep Learning	CodeCode Available
(Locally) Differentially Private Combinatorial Semi-Bandits	Jun 1, 2020	Multi-Armed BanditsPrivacy Preserving	—Unverified
To update or not to update? Delayed Nonparametric Bandits with Randomized Allocation	May 26, 2020	Multi-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 89 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified