SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 421–430 of 1262 papers

Title	Date	Tasks	Status
Efficient Contextual Bandits with Uninformed Feedback Graphs	Feb 12, 2024	Multi-Armed Banditsregression	—Unverified
Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits	Feb 16, 2022	Multi-Armed Bandits	—Unverified
Delegating via Quitting Games	Apr 20, 2018	Multi-Armed Bandits	—Unverified
Delay-Adaptive Learning in Generalized Linear Contextual Bandits	Mar 11, 2020	Multi-Armed BanditsThompson Sampling	—Unverified
An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits	Oct 8, 2017	Multi-Armed Bandits	—Unverified
Efficient Generalized Low-Rank Tensor Contextual Bandits	Nov 3, 2023	Decision MakingMulti-Armed Bandits	—Unverified
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems	Jan 22, 2025	Decision MakingEdge-computing	—Unverified
Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection	Oct 8, 2021	Multi-Armed Bandits	—Unverified
Deep Contextual Bandits for Fast Initial Access in mmWave Based User-Centric Ultra-Dense Networks	Sep 15, 2020	ManagementMulti-Armed Bandits	—Unverified
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching	Jan 24, 2019	Decision MakingEfficient Exploration	—Unverified

Show:10 25 50

← PrevPage 43 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified