Off-Policy Evaluation for Large Action Spaces via Embeddings Feb 13, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 25 Hypothesis Generation with Large Language Models Apr 5, 2024 Multi-Armed Bandits
Code Code Available 25 Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model Feb 3, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 25 Generalized Linear Bandits with Local Differential Privacy Jun 7, 2021 Decision Making Multi-Armed Bandits
Code Code Available 15 BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits Jun 11, 2020 Clustering Multi-Armed Bandits
Code Code Available 15 In-Context Reinforcement Learning for Variable Action Spaces Dec 20, 2023 In-Context Reinforcement Learning Multi-Armed Bandits
Code Code Available 15 Performance-bounded Online Ensemble Learning Method Based on Multi-armed bandits and Its Applications in Real-time Safety Assessment Mar 19, 2025 Ensemble Learning Multi-Armed Bandits
Code Code Available 15 Equitable Restless Multi-Armed Bandits: A General Framework Inspired By Digital Health Aug 17, 2023 Decision Making Fairness
Code Code Available 15 Neural Exploitation and Exploration of Contextual Bandits May 5, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 15 Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits Jun 3, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 15 Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling Jul 9, 2022 Bayesian Optimization Decision Making
Code Code Available 15 A unifying framework for generalised Bayesian online learning in non-stationary environments Nov 15, 2024 Continual Learning Multi-Armed Bandits
Code Code Available 15 BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits Dec 1, 2020 Clustering Multi-Armed Bandits
Code Code Available 15 Anytime-valid off-policy inference for contextual bandits Oct 19, 2022 counterfactual Multi-Armed Bandits
Code Code Available 15 EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits Oct 7, 2021 Multi-Armed Bandits Thompson Sampling
Code Code Available 15 Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling Oct 29, 2018 Collaborative Filtering Decision Making
Code Code Available 15 An empirical evaluation of active inference in multi-armed bandits Jan 21, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 15 Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation Apr 2, 2020 Multi-Armed Bandits
Code Code Available 15 Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits Oct 31, 2022 Multi-Armed Bandits
Code Code Available 15 Multi-agent Dynamic Algorithm Configuration Oct 13, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
Code Code Available 15 Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization Nov 27, 2021 Multi-Armed Bandits
Code Code Available 15 Balans: Multi-Armed Bandits-based Adaptive Large Neighborhood Search for Mixed-Integer Programming Problem Dec 18, 2024 Combinatorial Optimization Multi-Armed Bandits
Code Code Available 15 SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments May 21, 2022 Edge-computing Multi-Armed Bandits
Code Code Available 15 Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL May 10, 2020 Decision Making Lifelong learning
Code Code Available 15 Competing for Shareable Arms in Multi-Player Multi-Armed Bandits May 30, 2023 Multi-Armed Bandits
Code Code Available 15 Carousel Personalization in Music Streaming Apps with Contextual Bandits Sep 14, 2020 Multi-Armed Bandits
Code Code Available 15 Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks May 10, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 15 Neural Thompson Sampling Oct 2, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 15 LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits Oct 2, 2024 Instruction Following Math
Code Code Available 15 Langevin Monte Carlo for Contextual Bandits Jun 22, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 15 Discovering Minimal Reinforcement Learning Environments Jun 18, 2024 continuous-control Continuous Control
Code Code Available 15 Efficient Contextual Bandits with Continuous Actions Jun 10, 2020 Multi-Armed Bandits
Code Code Available 15 Multiplayer Multi-armed Bandits for Optimal Assignment in Heterogeneous Networks Jan 12, 2019 Multi-Armed Bandits
Code Code Available 15 Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits May 11, 2023 Multi-Armed Bandits
Code Code Available 15 Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces May 8, 2022 BIG-bench Machine Learning Deep Reinforcement Learning
Code Code Available 15 A Modern Introduction to Online Learning Dec 31, 2019 All Multi-Armed Bandits
Code Code Available 15 Federated Multi-Armed Bandits Jan 28, 2021 Federated Learning Multi-Armed Bandits
Code Code Available 15 Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 05 Adapting multi-armed bandits policies to contextual bandits scenarios Nov 11, 2018 Binary Classification Classification
Code Code Available 05 Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents Aug 6, 2024 Multi-Armed Bandits Sensitivity
Code Code Available 05 Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 05 Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 05 Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks Mar 9, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals Jun 12, 2023 Multi-Armed Bandits
Code Code Available 05 Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO Algorithm Nov 10, 2022 Decision Making Decision Making Under Uncertainty
Code Code Available 05 Best Arm Identification with Fixed Budget: A Large Deviation Perspective Dec 19, 2023 Multi-Armed Bandits
Code Code Available 05 Bandit-Based Monte Carlo Optimization for Nearest Neighbors May 21, 2018 Clustering Multi-Armed Bandits
Code Code Available 05 Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards Jun 3, 2019 Multi-Armed Bandits
Code Code Available 05