Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Dec 24, 2023 Multi-Armed Bandits
— Unverified 00 Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Best Arm Identification under Additive Transfer Bandits Dec 8, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 00 Multi-player Multi-armed Bandits for Stable Allocation in Heterogeneous Ad-Hoc Networks Dec 24, 2018 channel selection Multi-Armed Bandits
— Unverified 00 Best Arm Identification in Stochastic Bandits: Beyond β-optimality Jan 10, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 An Empirical Evaluation of Federated Contextual Bandit Algorithms Mar 17, 2023 Federated Learning Multi-Armed Bandits
— Unverified 00 Best Arm Identification in Restless Markov Multi-Armed Bandits Mar 29, 2022 Multi-Armed Bandits
— Unverified 00 Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits Mar 3, 2020 Decision Making Multi-Armed Bandits
— Unverified 00 Best arm identification in multi-armed bandits with delayed feedback Mar 29, 2018 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Best Arm Identification in Linked Bandits Nov 19, 2018 Multi-Armed Bandits
— Unverified 00 Discrete Choice Multi-Armed Bandits Oct 1, 2023 Discrete Choice Models Multi-Armed Bandits
— Unverified 00 Best-Arm Identification in Correlated Multi-Armed Bandits Sep 10, 2021 Multi-Armed Bandits
— Unverified 00 An Efficient Algorithm for Deep Stochastic Contextual Bandits Apr 12, 2021 Multi-Armed Bandits Stochastic Optimization
— Unverified 00 Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds Mar 1, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Active Reinforcement Learning: Observing Rewards at a Cost Nov 13, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits Oct 8, 2024 Change Detection Multi-Armed Bandits
— Unverified 00 Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 00 Disentangling Exploration from Exploitation Apr 29, 2024 Disentanglement Multi-Armed Bandits
— Unverified 00 Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication Apr 12, 2019 Multi-Armed Bandits
— Unverified 00 Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme Jun 11, 2020 Multi-Armed Bandits
— Unverified 00 Distributed Differential Privacy in Multi-Armed Bandits Jun 12, 2022 Multi-Armed Bandits
— Unverified 00 Distributed Exploration in Multi-Armed Bandits Nov 4, 2013 Multi-Armed Bandits
— Unverified 00 Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost? May 29, 2019 Multi-Armed Bandits
— Unverified 00 Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints Jan 21, 2024 Multi-Armed Bandits Multi-Task Learning
— Unverified 00 Distributed Online Learning via Cooperative Contextual Bandits Aug 21, 2013 Event Detection Multi-Armed Bandits
— Unverified 00 Distributed Optimization via Kernelized Multi-armed Bandits Dec 7, 2023 Decision Making Distributed Optimization
— Unverified 00 Efficient Prompt Optimization Through the Lens of Best Arm Identification Feb 15, 2024 Instruction Following Multi-Armed Bandits
— Unverified 00 An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives Jun 10, 2015 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 00 Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits Jan 1, 2020 Multi-Armed Bandits
— Unverified 00 Distributionally Robust Batch Contextual Bandits Jun 10, 2020 Multi-Armed Bandits
— Unverified 00 Differentially Private Multi-Armed Bandits in the Shuffle Model Jun 5, 2021 Multi-Armed Bandits
— Unverified 00 Distribution-Dependent Rates for Multi-Distribution Learning Dec 20, 2023 Multi-Armed Bandits
— Unverified 00 Diversify and Conquer: Bandits and Diversity for an Enhanced E-commerce Homepage Experience Sep 25, 2023 Diversity Multi-Armed Bandits
— Unverified 00 Diversity-Based Recruitment in Crowdsensing By Combinatorial Multi-Armed Bandits Dec 25, 2023 Diversity Multi-Armed Bandits
— Unverified 00 Differentially Private Kernelized Contextual Bandits Jan 13, 2025 Multi-Armed Bandits
— Unverified 00 DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Oct 7, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 00 Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 00 Be Greedy in Multi-Armed Bandits Jan 4, 2021 Multi-Armed Bandits
— Unverified 00 Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards Jun 1, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Meta-Learning Bandit Policies by Gradient Ascent Jun 9, 2020 Meta-Learning Multi-Armed Bandits
— Unverified 00 Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
— Unverified 00 Beam Learning -- Using Machine Learning for Finding Beam Directions Jun 11, 2019 BIG-bench Machine Learning Multi-Armed Bandits
— Unverified 00 Doubly Robust Policy Evaluation and Optimization Mar 10, 2015 Decision Making Multi-Armed Bandits
— Unverified 00 A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits Aug 27, 2019 Change Detection Multi-Armed Bandits
— Unverified 00 Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions Feb 26, 2020 Multi-Armed Bandits
— Unverified 00 Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 BEACON: Balancing Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes Jun 19, 2024 Multi-Armed Bandits Nutrition
— Unverified 00