Finding All ε-Good Arms in Stochastic Bandits Jun 16, 2020 All Multi-Armed Bandits
Code Code Available 0Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback Jan 2, 2019 Multi-Armed Bandits
Code Code Available 0Let's Get It Started: Fostering the Discoverability of New Releases on Deezer Jan 5, 2024 Multi-Armed Bandits
Code Code Available 0Ranking In Generalized Linear Bandits Jun 30, 2022 Diversity Multi-Armed Bandits
Code Code Available 0Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming May 25, 2018 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits Jul 23, 2021 Multi-Armed Bandits
Code Code Available 0Online Limited Memory Neural-Linear Bandits with Likelihood Matching Feb 7, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Online Matching: A Real-time Bandit System for Large-scale Recommendations Jul 29, 2023 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Thompson Sampling for Contextual Bandits with Linear Payoffs Sep 15, 2012 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Semiparametric Contextual Bandits Mar 12, 2018 Multi-Armed Bandits
Code Code Available 0Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design Sep 2, 2024 Event Detection Multi-Armed Bandits
Code Code Available 0Active Feature Selection for the Mutual Information Criterion Dec 13, 2020 feature selection Multi-Armed Bandits
Code Code Available 0Corralling a Band of Bandit Algorithms Dec 19, 2016 Multi-Armed Bandits
Code Code Available 0Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward Sep 17, 2020 Clustering Decision Making
Code Code Available 0Correlated Multi-armed Bandits with a Latent Random Source Aug 17, 2018 Multi-Armed Bandits
Code Code Available 0A New Bandit Setting Balancing Information from State Evolution and Corrupted Context Nov 16, 2020 Decision Making Efficient Exploration
Code Code Available 0Linear Contextual Bandits with Hybrid Payoff: Revisited Jun 14, 2024 Diversity Multi-Armed Bandits
Code Code Available 0Persistency of Excitation for Robustness of Neural Networks Nov 4, 2019 Multi-Armed Bandits
Code Code Available 0Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits Nov 11, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms Dec 1, 2020 Multi-Armed Bandits
Code Code Available 0Recurrent Neural-Linear Posterior Sampling for Nonstationary Contextual Bandits Jul 9, 2020 Multi-Armed Bandits
Code Code Available 0A Convex Framework for Confounding Robust Inference Sep 21, 2023 Model Selection Multi-Armed Bandits
Code Code Available 0From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance Feb 7, 2025 Multi-Armed Bandits
Code Code Available 0From Theory to Practice with RAVEN-UCB: Addressing Non-Stationarity in Multi-Armed Bandits through Variance Adaptation Jun 3, 2025 Multi-Armed Bandits
Code Code Available 0Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits Oct 25, 2023 Multi-Armed Bandits
Code Code Available 0Networked Restless Bandits with Positive Externalities Dec 9, 2022 Multi-Armed Bandits
Code Code Available 0Locally Differentially Private (Contextual) Bandits Learning Jun 1, 2020 Multi-Armed Bandits Privacy Preserving Deep Learning
Code Code Available 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Locally Private Nonparametric Contextual Multi-armed Bandits Mar 11, 2025 Decision Making Multi-Armed Bandits
Code Code Available 0Decentralized Cooperative Stochastic Bandits Oct 10, 2018 Multi-Armed Bandits
Code Code Available 0Gaussian Gated Linear Networks Jun 10, 2020 Denoising Density Estimation
Code Code Available 0Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions Oct 24, 2022 Metric Learning Multi-Armed Bandits
Code Code Available 0(Almost) Free Incentivized Exploration from Decentralized Learning Agents Oct 27, 2021 Multi-Armed Bandits
Code Code Available 0Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery Feb 24, 2024 Multi-Armed Bandits
Code Code Available 0MABSplit: Faster Forest Training Using Multi-Armed Bandits Dec 14, 2022 Feature Importance Multi-Armed Bandits
Code Code Available 0Risk-Aware Continuous Control with Neural Contextual Bandits Dec 15, 2023 continuous-control Continuous Control
Code Code Available 0Thompson Sampling for Linearly Constrained Bandits Apr 20, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Bayesian Optimisation over Multiple Continuous and Categorical Inputs Jun 20, 2019 Bayesian Optimisation Diversity
Code Code Available 0Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Dec 3, 2023 Causal Inference Multi-Armed Bandits
Code Code Available 0Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints Aug 24, 2023 Diversity Multi-Armed Bandits
Code Code Available 0Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning Dec 1, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Bayesian Design Principles for Frequentist Sequential Learning Oct 1, 2023 Multi-Armed Bandits reinforcement-learning
Code Code Available 0On Private Online Convex Optimization: Optimal Algorithms in _p-Geometry and High Dimensional Contextual Bandits Jun 16, 2022 Multi-Armed Bandits
Code Code Available 0Piecewise-Stationary Multi-Objective Multi-Armed Bandit with Application to Joint Communications and Sensing Feb 10, 2023 Change Detection Multi-Armed Bandits
Code Code Available 0Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity Apr 10, 2024 Decision Making Meta Reinforcement Learning
Code Code Available 0Thompson Sampling for Multinomial Logit Contextual Bandits Dec 1, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Sequential Learning of the Pareto Front for Multi-objective Bandits Jan 29, 2025 Multi-Armed Bandits
Code Code Available 0Medoids in almost linear time via multi-armed bandits Nov 2, 2017 Multi-Armed Bandits
Code Code Available 0