SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 281–290 of 1262 papers

Title	Date	Tasks	Status
Contextual bandits with concave rewards, and an application to fair ranking	Oct 18, 2022	FairnessMulti-Armed Bandits	—Unverified
Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting	Feb 5, 2019	Multi-Armed Bandits	—Unverified
Contextual Bandits with Cross-learning	Sep 25, 2018	Multi-Armed Bandits	—Unverified
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards	Aug 22, 2024	Language ModelingLanguage Modelling	—Unverified
Asymptotic Randomised Control with applications to bandits	Oct 14, 2020	ARCMulti-Armed Bandits	—Unverified
Contextual Bandits with Knapsacks for a Conversion Model	Jun 1, 2022	modelMulti-Armed Bandits	—Unverified
Contextual Bandits with Latent Confounders: An NMF Approach	Jun 1, 2016	Matrix CompletionMulti-Armed Bandits	—Unverified
Contextual Bandits with Non-Stationary Correlated Rewards for User Association in MmWave Vehicular Networks	Oct 8, 2024	Multi-Armed BanditsThompson Sampling	—Unverified
Contextual Bandits with Online Neural Regression	Dec 12, 2023	Multi-Armed Banditsregression	—Unverified
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation	Jun 12, 2024	Federated LearningMulti-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 29 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified