Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards Jun 3, 2019 Multi-Armed Bandits
Code Code Available 0Model selection for contextual bandits Jun 3, 2019 model Model Selection
Code Code Available 0Equipping Experts/Bandits with Long-term Memory May 30, 2019 Multi-Armed Bandits
— Unverified 0Rarely-switching linear bandits: optimization of causal effects for the real world May 30, 2019 Causal Inference Multi-Armed Bandits
— Unverified 0Multi-Objective Generalized Linear Bandits May 30, 2019 Multi-Armed Bandits
— Unverified 0Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits May 30, 2019 Multi-Armed Bandits
— Unverified 0Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost? May 29, 2019 Multi-Armed Bandits
— Unverified 0Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems May 29, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Top-k Combinatorial Bandits with Full-Bandit Feedback May 28, 2019 Multi-Armed Bandits
— Unverified 0Are sample means in multi-armed bandits positively or negatively biased? May 27, 2019 Multi-Armed Bandits Selection bias
— Unverified 0Achieving Fairness in Stochastic Multi-armed Bandit Problem May 27, 2019 Fairness Multi-Armed Bandits
— Unverified 0OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits May 24, 2019 Multi-Armed Bandits
— Unverified 0Data Poisoning Attacks on Stochastic Bandits May 16, 2019 Data Poisoning Multi-Armed Bandits
— Unverified 0Lessons from Contextual Bandit Learning in a Customer Support Bot May 6, 2019 Information Retrieval Multi-Armed Bandits
— Unverified 0Tight Regret Bounds for Infinite-armed Linear Contextual Bandits May 4, 2019 Decision Making Multi-Armed Bandits
— Unverified 0Meta-learners' learning dynamics are unlike learners' May 3, 2019 Meta-Learning Multi-Armed Bandits
— Unverified 0Non-Stochastic Multi-Player Multi-Armed Bandits: Optimal Rate With Collision Information, Sublinear Without Apr 28, 2019 Multi-Armed Bandits
— Unverified 0Constrained Restless Bandits for Dynamic Scheduling in Cyber-Physical Systems Apr 18, 2019 Decision Making Decision Making Under Uncertainty
— Unverified 0Introduction to Multi-Armed Bandits Apr 15, 2019 Multi-Armed Bandits
Code Code Available 0Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication Apr 12, 2019 Multi-Armed Bandits
— Unverified 0Collaborative Learning with Limited Interaction: Tight Bounds for Distributed Exploration in Multi-Armed Bandits Apr 5, 2019 Multi-Armed Bandits
— Unverified 0Batched Multi-armed Bandits Problem Apr 3, 2019 Multi-Armed Bandits
Code Code Available 0A Survey on Practical Applications of Multi-Armed and Contextual Bandits Apr 2, 2019 Information Retrieval Multi-Armed Bandits
— Unverified 0Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits Mar 30, 2019 Multi-Armed Bandits
— Unverified 0Meta-Learning surrogate models for sequential decision making Mar 28, 2019 Bayesian Optimisation Decision Making
— Unverified 0Contextual Bandits with Random Projection Mar 20, 2019 Multi-Armed Bandits
— Unverified 0From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization Mar 7, 2019 compressed sensing Multi-Armed Bandits
Code Code Available 0Perturbed-History Exploration in Stochastic Multi-Armed Bandits Feb 26, 2019 Multi-Armed Bandits
— Unverified 0Better Algorithms for Stochastic Bandits with Adversarial Corruptions Feb 22, 2019 Multi-Armed Bandits
— Unverified 0AdaLinUCB: Opportunistic Learning for Contextual Bandits Feb 20, 2019 Multi-Armed Bandits
— Unverified 0Equal Opportunity in Online Classification with Partial Feedback Feb 6, 2019 Classification Decision Making Under Uncertainty
Code Code Available 0Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting Feb 5, 2019 Multi-Armed Bandits
— Unverified 0Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards Feb 3, 2019 Multi-Armed Bandits
— Unverified 0A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free Feb 3, 2019 Multi-Armed Bandits
— Unverified 0On the bias, risk and consistency of sample means in multi-armed bandits Feb 2, 2019 Multi-Armed Bandits Selection bias
— Unverified 0Target Tracking for Contextual Bandits: Application to Demand Side Management Jan 28, 2019 Management Multi-Armed Bandits
— Unverified 0Almost Boltzmann Exploration Jan 25, 2019 Multi-Armed Bandits Reinforcement Learning
— Unverified 0Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching Jan 24, 2019 Decision Making Efficient Exploration
— Unverified 0The Assistive Multi-Armed Bandit Jan 24, 2019 Multi-Armed Bandits
Code Code Available 0PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits Jan 24, 2019 Multi-Armed Bandits
— Unverified 0Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory Jan 24, 2019 Multi-Armed Bandits
— Unverified 0Parallel Contextual Bandits in Wireless Handover Optimization Jan 21, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Imitation-Regularized Offline Learning Jan 15, 2019 counterfactual Multi-Armed Bandits
— Unverified 0Multiplayer Multi-armed Bandits for Optimal Assignment in Heterogeneous Networks Jan 12, 2019 Multi-Armed Bandits
Code Code Available 1Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions Jan 4, 2019 Multi-Armed Bandits
— Unverified 0Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback Jan 2, 2019 Multi-Armed Bandits
Code Code Available 0Multi-player Multi-armed Bandits for Stable Allocation in Heterogeneous Ad-Hoc Networks Dec 24, 2018 channel selection Multi-Armed Bandits
— Unverified 0Human-AI Learning Performance in Multi-Armed Bandits Dec 21, 2018 Decision Making Multi-Armed Bandits
— Unverified 0Generalizable Meta-Heuristic based on Temporal Estimation of Rewards for Large Scale Blackbox Optimization Dec 17, 2018 Multi-Armed Bandits
— Unverified 0Balanced Linear Contextual Bandits Dec 15, 2018 Causal Inference Multi-Armed Bandits
— Unverified 0