Multi-Armed Bandits for Minesweeper: Profiting from Exploration-Exploitation Synergy Jul 25, 2020 Multi-Armed Bandits
— Unverified 0Competing Bandits: The Perils of Exploration Under Competition Jul 20, 2020 Multi-Armed Bandits
— Unverified 0Minimax Policy for Heavy-tailed Bandits Jul 20, 2020 Multi-Armed Bandits
— Unverified 0Self-Tuning Bandits over Unknown Covariate-Shifts Jul 16, 2020 Multi-Armed Bandits
— Unverified 0Upper Counterfactual Confidence Bounds: a New Optimism Principle for Contextual Bandits Jul 15, 2020 counterfactual Multi-Armed Bandits
— Unverified 0Quantum exploration algorithms for multi-armed bandits Jul 14, 2020 Multi-Armed Bandits
Code Code Available 0Optimal Learning for Structured Bandits Jul 14, 2020 Decision Making Decision Making Under Uncertainty
Code Code Available 0Fair Algorithms for Multi-Agent Multi-Armed Bandits Jul 13, 2020 Fairness Multi-Armed Bandits
— Unverified 0Recurrent Neural-Linear Posterior Sampling for Nonstationary Contextual Bandits Jul 9, 2020 Multi-Armed Bandits
Code Code Available 0Robust Multi-Agent Multi-Armed Bandits Jul 7, 2020 Distributed Computing Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Local Differential Privacy Jul 6, 2020 Multi-Armed Bandits
— Unverified 0Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design Jul 4, 2020 Active Learning Multi-Armed Bandits
— Unverified 0Continuous-Time Multi-Armed Bandits with Controlled Restarts Jun 30, 2020 Multi-Armed Bandits
— Unverified 0Offline Contextual Bandits with Overparameterized Models Jun 27, 2020 Multi-Armed Bandits Q-Learning
Code Code Available 0Online learning with Corrupted context: Corrupted Contextual Bandits Jun 26, 2020 Multi-Armed Bandits
— Unverified 0Approximating a Target Distribution using Weight Queries Jun 24, 2020 Domain Adaptation Multi-Armed Bandits
Code Code Available 0Adaptive Discretization against an Adversary: Lipschitz bandits, Dynamic Pricing, and Auction Tuning Jun 22, 2020 Multi-Armed Bandits
— Unverified 0Towards Tractable Optimism in Model-Based Reinforcement Learning Jun 21, 2020 continuous-control Continuous Control
— Unverified 0Open Problem: Model Selection for Contextual Bandits Jun 19, 2020 model Model Selection
— Unverified 0Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect Jun 18, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Stochastic Network Utility Maximization with Unknown Utilities: Multi-Armed Bandits Approach Jun 17, 2020 Multi-Armed Bandits
— Unverified 0Stochastic Bandits with Linear Constraints Jun 17, 2020 Multi-Armed Bandits
— Unverified 0Constrained regret minimization for multi-criterion multi-armed bandits Jun 17, 2020 Attribute Multi-Armed Bandits
Code Code Available 0Finding All ε-Good Arms in Stochastic Bandits Jun 16, 2020 All Multi-Armed Bandits
Code Code Available 0Non-Stationary Off-Policy Optimization Jun 15, 2020 Multi-Armed Bandits
— Unverified 0Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners Jun 13, 2020 Multi-Armed Bandits
— Unverified 0Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme Jun 11, 2020 Multi-Armed Bandits
— Unverified 0TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation Jun 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Bandits with Partially Observable Confounded Data Jun 11, 2020 Multi-Armed Bandits
— Unverified 0Gaussian Gated Linear Networks Jun 10, 2020 Denoising Density Estimation
Code Code Available 0Distributionally Robust Batch Contextual Bandits Jun 10, 2020 Multi-Armed Bandits
— Unverified 0Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition Jun 10, 2020 Multi-Armed Bandits
— Unverified 0Meta-Learning Bandit Policies by Gradient Ascent Jun 9, 2020 Meta-Learning Multi-Armed Bandits
— Unverified 0Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior Jun 9, 2020 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Contextual Bandits with Side-Observations Jun 6, 2020 Multi-Armed Bandits
— Unverified 0Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs Jun 5, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Locally Differentially Private (Contextual) Bandits Learning Jun 1, 2020 Multi-Armed Bandits Privacy Preserving Deep Learning
Code Code Available 0(Locally) Differentially Private Combinatorial Semi-Bandits Jun 1, 2020 Multi-Armed Bandits Privacy Preserving
— Unverified 0To update or not to update? Delayed Nonparametric Bandits with Randomized Allocation May 26, 2020 Multi-Armed Bandits
— Unverified 0Greedy Algorithm almost Dominates in Smoothed Contextual Bandits May 19, 2020 Diversity Multi-Armed Bandits
— Unverified 0Neural Network Retraining for Model Serving Apr 29, 2020 model Multi-Armed Bandits
— Unverified 0Learning to Rank in the Position Based Model with Bandit Feedback Apr 27, 2020 Learning-To-Rank Multi-Armed Bandits
— Unverified 0Thompson Sampling for Linearly Constrained Bandits Apr 20, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Sequential Batch Learning in Finite-Action Linear Contextual Bandits Apr 14, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Power Constrained Bandits Apr 13, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Exploration with Limited Memory: Streaming Algorithms for Coin Tossing, Noisy Comparisons, and Multi-Armed Bandits Apr 9, 2020 Multi-Armed Bandits
— Unverified 0Hawkes Process Multi-armed Bandits for Disaster Search and Rescue Apr 3, 2020 Multi-Armed Bandits
— Unverified 0Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability Mar 28, 2020 Multi-Armed Bandits regression
— Unverified 0Optimal No-regret Learning in Repeated First-price Auctions Mar 22, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0