Fair Algorithms for Multi-Agent Multi-Armed Bandits Jul 13, 2020 Fairness Multi-Armed Bandits
— Unverified 0Recurrent Neural-Linear Posterior Sampling for Nonstationary Contextual Bandits Jul 9, 2020 Multi-Armed Bandits
Code Code Available 0Robust Multi-Agent Multi-Armed Bandits Jul 7, 2020 Distributed Computing Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Local Differential Privacy Jul 6, 2020 Multi-Armed Bandits
— Unverified 0Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design Jul 4, 2020 Active Learning Multi-Armed Bandits
— Unverified 0Continuous-Time Multi-Armed Bandits with Controlled Restarts Jun 30, 2020 Multi-Armed Bandits
— Unverified 0Offline Contextual Bandits with Overparameterized Models Jun 27, 2020 Multi-Armed Bandits Q-Learning
Code Code Available 0Online learning with Corrupted context: Corrupted Contextual Bandits Jun 26, 2020 Multi-Armed Bandits
— Unverified 0Approximating a Target Distribution using Weight Queries Jun 24, 2020 Domain Adaptation Multi-Armed Bandits
Code Code Available 0Adaptive Discretization against an Adversary: Lipschitz bandits, Dynamic Pricing, and Auction Tuning Jun 22, 2020 Multi-Armed Bandits
— Unverified 0Towards Tractable Optimism in Model-Based Reinforcement Learning Jun 21, 2020 continuous-control Continuous Control
— Unverified 0Open Problem: Model Selection for Contextual Bandits Jun 19, 2020 model Model Selection
— Unverified 0Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect Jun 18, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Stochastic Bandits with Linear Constraints Jun 17, 2020 Multi-Armed Bandits
— Unverified 0Constrained regret minimization for multi-criterion multi-armed bandits Jun 17, 2020 Attribute Multi-Armed Bandits
Code Code Available 0Stochastic Network Utility Maximization with Unknown Utilities: Multi-Armed Bandits Approach Jun 17, 2020 Multi-Armed Bandits
— Unverified 0Finding All ε-Good Arms in Stochastic Bandits Jun 16, 2020 All Multi-Armed Bandits
Code Code Available 0Non-Stationary Off-Policy Optimization Jun 15, 2020 Multi-Armed Bandits
— Unverified 0Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners Jun 13, 2020 Multi-Armed Bandits
— Unverified 0Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme Jun 11, 2020 Multi-Armed Bandits
— Unverified 0BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits Jun 11, 2020 Clustering Multi-Armed Bandits
Code Code Available 1Bandits with Partially Observable Confounded Data Jun 11, 2020 Multi-Armed Bandits
— Unverified 0TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation Jun 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Efficient Contextual Bandits with Continuous Actions Jun 10, 2020 Multi-Armed Bandits
Code Code Available 1Gaussian Gated Linear Networks Jun 10, 2020 Denoising Density Estimation
Code Code Available 0Distributionally Robust Batch Contextual Bandits Jun 10, 2020 Multi-Armed Bandits
— Unverified 0Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition Jun 10, 2020 Multi-Armed Bandits
— Unverified 0Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior Jun 9, 2020 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Meta-Learning Bandit Policies by Gradient Ascent Jun 9, 2020 Meta-Learning Multi-Armed Bandits
— Unverified 0Contextual Bandits with Side-Observations Jun 6, 2020 Multi-Armed Bandits
— Unverified 0Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs Jun 5, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0(Locally) Differentially Private Combinatorial Semi-Bandits Jun 1, 2020 Multi-Armed Bandits Privacy Preserving
— Unverified 0Locally Differentially Private (Contextual) Bandits Learning Jun 1, 2020 Multi-Armed Bandits Privacy Preserving Deep Learning
Code Code Available 0To update or not to update? Delayed Nonparametric Bandits with Randomized Allocation May 26, 2020 Multi-Armed Bandits
— Unverified 0Greedy Algorithm almost Dominates in Smoothed Contextual Bandits May 19, 2020 Diversity Multi-Armed Bandits
— Unverified 0Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL May 10, 2020 Decision Making Lifelong learning
Code Code Available 1Neural Network Retraining for Model Serving Apr 29, 2020 model Multi-Armed Bandits
— Unverified 0Learning to Rank in the Position Based Model with Bandit Feedback Apr 27, 2020 Learning-To-Rank Multi-Armed Bandits
— Unverified 0Thompson Sampling for Linearly Constrained Bandits Apr 20, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Sequential Batch Learning in Finite-Action Linear Contextual Bandits Apr 14, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Power Constrained Bandits Apr 13, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Exploration with Limited Memory: Streaming Algorithms for Coin Tossing, Noisy Comparisons, and Multi-Armed Bandits Apr 9, 2020 Multi-Armed Bandits
— Unverified 0Hawkes Process Multi-armed Bandits for Disaster Search and Rescue Apr 3, 2020 Multi-Armed Bandits
— Unverified 0Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation Apr 2, 2020 Multi-Armed Bandits
Code Code Available 1Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability Mar 28, 2020 Multi-Armed Bandits regression
— Unverified 0Optimal No-regret Learning in Repeated First-price Auctions Mar 22, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Self-Supervised Contextual Bandits in Computer Vision Mar 18, 2020 Clustering Colorization
— Unverified 0Learning and Fairness in Energy Harvesting: A Maximin Multi-Armed Bandits Approach Mar 13, 2020 Fairness Multi-Armed Bandits
— Unverified 0Delay-Adaptive Learning in Generalized Linear Contextual Bandits Mar 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0