SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 641–650 of 1262 papers

Title	Date	Tasks	Status
Neural Contextual Bandits for Personalized Recommendation	Dec 21, 2023	Multi-Armed BanditsRecommendation Systems	—Unverified
Neural Contextual Bandits Under Delayed Feedback Constraints	Apr 16, 2025	Multi-Armed BanditsRecommendation Systems	—Unverified
Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits	Mar 8, 2022	Multi-Armed Bandits	—Unverified
Neural Contextual Bandits with Deep Representation and Shallow Exploration	Dec 3, 2020	Multi-Armed BanditsRepresentation Learning	—Unverified
Neural Network Retraining for Model Serving	Apr 29, 2020	modelMulti-Armed Bandits	—Unverified
Neural Risk-sensitive Satisficing in Contextual Bandits	Jan 15, 2025	Multi-Armed BanditsRecommendation Systems	—Unverified
NeuralUCB: Contextual Bandits with Neural Network-Based Exploration	Sep 25, 2019	Efficient ExplorationMulti-Armed Bandits	—Unverified
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees	Aug 23, 2021	Decision MakingDecision Making Under Uncertainty	—Unverified
Nonlinear Sequential Accepts and Rejects for Identification of Top Arms in Stochastic Bandits	Jul 9, 2017	Multi-Armed Bandits	—Unverified
Nonparametric Contextual Bandits in an Unknown Metric Space	Aug 3, 2019	Multi-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 65 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified