SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 181–190 of 1262 papers

Title	Date	Tasks	Status	Score
Latent Bottlenecked Attentive Neural Processes	Nov 15, 2022	Meta-LearningMulti-Armed Bandits	CodeCode Available	5
Learning Contextual Bandits in a Non-stationary Environment	May 23, 2018	Multi-Armed BanditsRecommendation Systems	CodeCode Available	5
Linear Contextual Bandits with Hybrid Payoff: Revisited	Jun 14, 2024	DiversityMulti-Armed Bandits	CodeCode Available	5
Locally Differentially Private (Contextual) Bandits Learning	Jun 1, 2020	Multi-Armed BanditsPrivacy Preserving Deep Learning	CodeCode Available	5
Confidence Intervals for Policy Evaluation in Adaptive Experiments	Nov 7, 2019	Experimental DesignMulti-Armed Bandits	CodeCode Available	5
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions	Oct 24, 2022	Metric LearningMulti-Armed Bandits	CodeCode Available	5
Adaptive Linear Estimating Equations	Jul 14, 2023	Multi-Armed Bandits	CodeCode Available	5
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits	Dec 3, 2023	Causal InferenceMulti-Armed Bandits	CodeCode Available	5
Best Arm Identification with Fixed Budget: A Large Deviation Perspective	Dec 19, 2023	Multi-Armed Bandits	CodeCode Available	5
Decentralized Cooperative Stochastic Bandits	Oct 10, 2018	Multi-Armed Bandits	CodeCode Available	5

Show:10 25 50

← PrevPage 19 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified