Multi-Armed Bandits with Correlated Arms Nov 6, 2019 Multi-Armed Bandits
Code Code Available 0Jump Starting Bandits with LLM-Generated Prior Knowledge Jun 27, 2024 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes Sep 5, 2019 Multi-Armed Bandits
Code Code Available 0Kernel Conditional Moment Constraints for Confounding Robust Inference Feb 26, 2023 Multi-Armed Bandits Sensitivity
Code Code Available 0Q-Learning Lagrange Policies for Multi-Action Restless Bandits Jun 22, 2021 Multi-Armed Bandits Q-Learning
Code Code Available 0Constrained regret minimization for multi-criterion multi-armed bandits Jun 17, 2020 Attribute Multi-Armed Bandits
Code Code Available 0Top-k eXtreme Contextual Bandits with Arm Hierarchy Feb 15, 2021 Computational Efficiency Extreme Multi-Label Classification
Code Code Available 0Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Generalized Linear Bandits with Limited Adaptivity Apr 10, 2024 Multi-Armed Bandits
Code Code Available 0Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards Apr 28, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0On-line Adaptative Curriculum Learning for GANs Jul 31, 2018 Multi-Armed Bandits Stochastic Optimization
Code Code Available 0Bandit-Based Monte Carlo Optimization for Nearest Neighbors May 21, 2018 Clustering Multi-Armed Bandits
Code Code Available 0Quantile Bandits for Best Arms Identification Oct 22, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Adaptive Linear Estimating Equations Jul 14, 2023 Multi-Armed Bandits
Code Code Available 0Latent Bottlenecked Attentive Neural Processes Nov 15, 2022 Meta-Learning Multi-Armed Bandits
Code Code Available 0Multi-armed Bandits with Missing Outcome Nov 8, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits with Network Interference May 28, 2024 Multi-Armed Bandits
Code Code Available 0Residual Loss Prediction: Reinforcement Learning With No Incremental Feedback Jan 1, 2018 Multi-Armed Bandits Prediction
Code Code Available 0Multi-facet Contextual Bandits: A Neural Network Perspective Jun 6, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Smoothness-Adaptive Contextual Bandits Oct 22, 2019 Decision Making Multi-Armed Bandits
Code Code Available 0Fairness of Exposure in Online Restless Multi-armed Bandits Feb 9, 2024 Fairness Multi-Armed Bandits
Code Code Available 0Learning Contextual Bandits in a Non-stationary Environment May 23, 2018 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Falcon: Fair Active Learning using Multi-armed Bandits Jan 23, 2024 Active Learning Attribute
Code Code Available 0Optimistic Whittle Index Policy: Online Learning for Restless Bandits May 30, 2022 Multi-Armed Bandits
Code Code Available 0Quantum exploration algorithms for multi-armed bandits Jul 14, 2020 Multi-Armed Bandits
Code Code Available 0Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 0Fast Beam Alignment via Pure Exploration in Multi-armed Bandits Oct 23, 2022 Multi-Armed Bandits
Code Code Available 0Contextual Bandits with Large Action Spaces: Made Practical Jul 12, 2022 Decision Making Multi-Armed Bandits
Code Code Available 0VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning Sep 14, 2020 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0Transfer Learning in Latent Contextual Bandits with Covariate Shift Through Causal Transportability Feb 27, 2025 Causal Inference Multi-Armed Bandits
Code Code Available 0Quantum Natural Policy Gradients: Towards Sample-Efficient Reinforcement Learning Apr 26, 2023 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Output-Weighted Sampling for Multi-Armed Bandits with Extreme Payoffs Feb 19, 2021 Decision Making Gaussian Processes
Code Code Available 0Offline Contextual Bandits with Overparameterized Models Jun 27, 2020 Multi-Armed Bandits Q-Learning
Code Code Available 0Solving Inverse Problem for Multi-armed Bandits via Convex Optimization Jan 31, 2025 Multi-Armed Bandits
Code Code Available 0Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Learning Structural Weight Uncertainty for Sequential Decision-Making Dec 30, 2017 Decision Making Multi-Armed Bandits
Code Code Available 0Nonstationary Continuum-Armed Bandit Strategies for Automated Trading in a Simulated Financial Market Aug 4, 2022 Bayesian Optimisation Bayesian Optimization
Code Code Available 0Contextual Bandits with Stochastic Experts Feb 23, 2018 Multi-Armed Bandits
Code Code Available 0Empirical analysis of representation learning and exploration in neural kernel bandits Nov 5, 2021 Bayesian Inference Decision Making
Code Code Available 0Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards Jun 3, 2019 Multi-Armed Bandits
Code Code Available 0Federated Multi-armed Bandits with Personalization Feb 25, 2021 Federated Learning Multi-Armed Bandits
Code Code Available 0Federated Neural Bandits May 28, 2022 Multi-Armed Bandits
Code Code Available 0Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior Jun 9, 2020 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Thompson Sampling for Bandit Learning in Matching Markets Apr 26, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Variational inference for the multi-armed contextual bandit Sep 10, 2017 Multi-Armed Bandits Reinforcement Learning
Code Code Available 0AC-Band: A Combinatorial Bandit-Based Approach to Algorithm Configuration Dec 1, 2022 Multi-Armed Bandits
Code Code Available 0PageRank Bandits for Link Prediction Nov 3, 2024 Decision Making Graph Learning
Code Code Available 0Stochastic Rising Bandits Dec 7, 2022 Model Selection Multi-Armed Bandits
Code Code Available 0Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles Mar 3, 2017 Multi-Armed Bandits
Code Code Available 0Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits Nov 8, 2024 Computational Efficiency Multi-Armed Bandits
Code Code Available 0