Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching Jan 24, 2019 Decision Making Efficient Exploration
— Unverified 0Context-Aware Bandits Oct 12, 2015 Clustering Multi-Armed Bandits
— Unverified 0Deep Contextual Bandits for Fast Initial Access in mmWave Based User-Centric Ultra-Dense Networks Sep 15, 2020 Management Multi-Armed Bandits
— Unverified 0Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection Oct 8, 2021 Multi-Armed Bandits
— Unverified 0Delay-Adaptive Learning in Generalized Linear Contextual Bandits Mar 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Delegating via Quitting Games Apr 20, 2018 Multi-Armed Bandits
— Unverified 0Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions Feb 26, 2020 Multi-Armed Bandits
— Unverified 0Meta-Learning Bandit Policies by Gradient Ascent Jun 9, 2020 Meta-Learning Multi-Armed Bandits
— Unverified 0Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards Jun 1, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 0Differentially Private Kernelized Contextual Bandits Jan 13, 2025 Multi-Armed Bandits
— Unverified 0Differentially Private Multi-Armed Bandits in the Shuffle Model Jun 5, 2021 Multi-Armed Bandits
— Unverified 0Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost? May 29, 2019 Multi-Armed Bandits
— Unverified 0Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 0Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits Oct 8, 2024 Change Detection Multi-Armed Bandits
— Unverified 0Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Discrete Choice Multi-Armed Bandits Oct 1, 2023 Discrete Choice Models Multi-Armed Bandits
— Unverified 0Disentangling Exploration from Exploitation Apr 29, 2024 Disentanglement Multi-Armed Bandits
— Unverified 0Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication Apr 12, 2019 Multi-Armed Bandits
— Unverified 0Asymptotic Instance-Optimal Algorithms for Interactive Decision Making Jun 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Distributed Differential Privacy in Multi-Armed Bandits Jun 12, 2022 Multi-Armed Bandits
— Unverified 0Distributed Exploration in Multi-Armed Bandits Nov 4, 2013 Multi-Armed Bandits
— Unverified 0Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget Nov 27, 2022 Attribute Multi-Armed Bandits
— Unverified 0Multi-player Multi-armed Bandits for Stable Allocation in Heterogeneous Ad-Hoc Networks Dec 24, 2018 channel selection Multi-Armed Bandits
— Unverified 0Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints Jan 21, 2024 Multi-Armed Bandits Multi-Task Learning
— Unverified 0Distributed Online Learning via Cooperative Contextual Bandits Aug 21, 2013 Event Detection Multi-Armed Bandits
— Unverified 0Distributed Optimization via Kernelized Multi-armed Bandits Dec 7, 2023 Decision Making Distributed Optimization
— Unverified 0Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 0Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits Jan 1, 2020 Multi-Armed Bandits
— Unverified 0Distributionally Robust Batch Contextual Bandits Jun 10, 2020 Multi-Armed Bandits
— Unverified 0Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits May 30, 2019 Multi-Armed Bandits
— Unverified 0Distribution-Dependent Rates for Multi-Distribution Learning Dec 20, 2023 Multi-Armed Bandits
— Unverified 0Diversify and Conquer: Bandits and Diversity for an Enhanced E-commerce Homepage Experience Sep 25, 2023 Diversity Multi-Armed Bandits
— Unverified 0Diversity-Based Recruitment in Crowdsensing By Combinatorial Multi-Armed Bandits Dec 25, 2023 Diversity Multi-Armed Bandits
— Unverified 0Diversity-Driven Selection of Exploration Strategies in Multi-Armed Bandits Aug 23, 2018 Diversity Multi-Armed Bandits
— Unverified 0DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Oct 7, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 0Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-Pricing Sep 14, 2023 Multi-Armed Bandits
— Unverified 0Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback Mar 15, 2025 Multi-Armed Bandits
— Unverified 0A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option Mar 6, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
— Unverified 0Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Apr 2, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Doubly Robust Policy Evaluation and Optimization Mar 10, 2015 Decision Making Multi-Armed Bandits
— Unverified 0Boltzmann Exploration Done Right May 29, 2017 Decision Making Decision Making Under Uncertainty
— Unverified 0Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adapting to Misspecification in Contextual Bandits Jul 12, 2021 Multi-Armed Bandits regression
— Unverified 0Dynamic Global Sensitivity for Differentially Private Contextual Bandits Aug 30, 2022 Interactive Recommendation Multi-Armed Bandits
— Unverified 0