Finding All -Good Arms in Stochastic Bandits Dec 1, 2020 All Multi-Armed Bandits
— Unverified 0A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit Nov 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Resonance: Replacing Software Constants with Context-Aware Models in Real-time Communication Nov 23, 2020 Friction Multi-Armed Bandits
— Unverified 0Fully Gap-Dependent Bounds for Multinomial Logit Bandit Nov 19, 2020 Multi-Armed Bandits
— Unverified 0A New Bandit Setting Balancing Information from State Evolution and Corrupted Context Nov 16, 2020 Decision Making Efficient Exploration
Code Code Available 0Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Nov 16, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0Metric-Free Individual Fairness with Cooperative Contextual Bandits Nov 13, 2020 Decision Making Fairness
— Unverified 0Improving Offline Contextual Bandits with Distributional Robustness Nov 13, 2020 counterfactual Multi-Armed Bandits
— Unverified 0Active Reinforcement Learning: Observing Rewards at a Cost Nov 13, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Multi-armed Bandits with Cost Subsidy Nov 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Towards Fundamental Limits of Multi-armed Bandits with Random Walk Feedback Nov 3, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications Nov 2, 2020 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Censored Consumption of Resources Nov 2, 2020 Multi-Armed Bandits
— Unverified 0Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism Oct 31, 2020 Distributed Computing Multi-Armed Bandits
— Unverified 0Learning to Actively Learn: A Robust Approach Oct 29, 2020 Active Learning Meta-Learning
— Unverified 0Tractable contextual bandits beyond realizability Oct 25, 2020 Multi-Armed Bandits
— Unverified 0Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards Oct 24, 2020 Multi-Armed Bandits
— Unverified 0Online Semi-Supervised Learning with Bandit Feedback Oct 23, 2020 Imputation Multi-Armed Bandits
— Unverified 0Online Algorithm for Unsupervised Sequential Selection with Contextual Information Oct 23, 2020 Multi-Armed Bandits
— Unverified 0Quantile Bandits for Best Arms Identification Oct 22, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Achieving User-Side Fairness in Contextual Bandits Oct 22, 2020 Fairness Multi-Armed Bandits
— Unverified 0DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees Oct 19, 2020 Attribute Decision Making
— Unverified 0Stochastic Bandits with Vector Losses: Minimizing ^-Norm of Relative Losses Oct 15, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Asymptotic Randomised Control with applications to bandits Oct 14, 2020 ARC Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Dependent Arms Oct 13, 2020 Multi-Armed Bandits
— Unverified 0Adapting to Delays and Data in Adversarial Multi-Armed Bandits Oct 12, 2020 Multi-Armed Bandits
— Unverified 0Online and Distribution-Free Robustness: Regression and Contextual Bandits with Huber Contamination Oct 8, 2020 Adversarial Robustness Multi-Armed Bandits
— Unverified 0Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective Oct 7, 2020 Active Learning Multi-Armed Bandits
— Unverified 0CorrAttack: Black-box Adversarial Attack with Structured Search Oct 3, 2020 Adversarial Attack Bayesian Optimization
— Unverified 0Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon Sep 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Bandits for adapting to changing User preferences over time Sep 21, 2020 Incremental Learning Multi-Armed Bandits
— Unverified 0Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms Sep 20, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward Sep 17, 2020 Clustering Decision Making
Code Code Available 0Partial Bandit and Semi-Bandit: Making the Most Out of Scarce Users' Feedback Sep 16, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Thompson Sampling for Unsupervised Sequential Selection Sep 16, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Deep Contextual Bandits for Fast Initial Access in mmWave Based User-Centric Ultra-Dense Networks Sep 15, 2020 Management Multi-Armed Bandits
— Unverified 0Dual-Mandate Patrols: Multi-Armed Bandits for Green Security Sep 14, 2020 Multi-Armed Bandits
Code Code Available 0VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning Sep 14, 2020 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0Unifying Clustered and Non-stationary Bandits Sep 5, 2020 Change Detection Clustering
— Unverified 0Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits Aug 28, 2020 Multi-Armed Bandits
— Unverified 0Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits Aug 27, 2020 Decision Making Marketing
— Unverified 0A Sleeping, Recovering Bandit Algorithm for Optimizing Recurring Notifications Aug 23, 2020 Multi-Armed Bandits
— Unverified 0Contextual Bandits for Advertising Budget Allocation Aug 22, 2020 Marketing Multi-Armed Bandits
— Unverified 0Offline Contextual Multi-armed Bandits for Mobile Health Interventions: A Case Study on Emotion Regulation Aug 21, 2020 Management Multi-Armed Bandits
— Unverified 0Using Subjective Logic to Estimate Uncertainty in Multi-Armed Bandit Problems Aug 17, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Kernel Methods for Cooperative Multi-Agent Contextual Bandits Aug 14, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Lenient Regret for Multi-Armed Bandits Aug 10, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0A framework for optimizing COVID-19 testing policy using a Multi Armed Bandit approach Jul 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Greedy Bandits with Sampled Context Jul 27, 2020 Decision Making Multi-Armed Bandits
— Unverified 0