Convex Hull Monte-Carlo Tree Search Mar 9, 2020 Multi-Armed Bandits
— Unverified 0Online Residential Demand Response via Contextual Multi-Armed Bandits Mar 7, 2020 Decision Making Multi-Armed Bandits
— Unverified 0A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option Mar 6, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Generalized Policy Elimination: an efficient algorithm for Nonparametric Contextual Bandits Mar 5, 2020 Multi-Armed Bandits
— Unverified 0Stochastic Linear Contextual Bandits with Diverse Contexts Mar 5, 2020 Diversity Multi-Armed Bandits
— Unverified 0Robustness Guarantees for Mode Estimation with an Application to Bandits Mar 5, 2020 Multi-Armed Bandits
— Unverified 0Taking a hint: How to leverage loss predictors in contextual bandits? Mar 4, 2020 Multi-Armed Bandits
— Unverified 0Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits Mar 3, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Model Selection in Contextual Stochastic Bandit Problems Mar 3, 2020 model Model Selection
— Unverified 0Bounded Regret for Finitely Parameterized Multi-Armed Bandits Mar 3, 2020 Multi-Armed Bandits
— Unverified 0Decentralized Multi-player Multi-armed Bandits with No Collision Information Feb 29, 2020 Multi-Armed Bandits
— Unverified 0Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions Feb 26, 2020 Multi-Armed Bandits
— Unverified 0Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis Feb 26, 2020 Multi-Armed Bandits
— Unverified 0Bandit Learning with Delayed Impact of Actions Feb 24, 2020 Fairness Multi-Armed Bandits
— Unverified 0The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms Feb 24, 2020 Multi-Armed Bandits
Code Code Available 0Survey Bandits with Regret Guarantees Feb 23, 2020 Multi-Armed Bandits Survey
— Unverified 0Online Learning in Contextual Bandits using Gated Linear Networks Feb 21, 2020 Multi-Armed Bandits
— Unverified 0Residual Bootstrap Exploration for Bandit Algorithms Feb 19, 2020 Computational Efficiency Multi-Armed Bandits
— Unverified 0On conditional versus marginal bias in multi-armed bandits Feb 19, 2020 Multi-Armed Bandits
— Unverified 0Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Coordination without communication: optimal regret in two players multi-armed bandits Feb 14, 2020 Multi-Armed Bandits Vocal Bursts Valence Prediction
— Unverified 0Tight Lower Bounds for Combinatorial Multi-Armed Bandits Feb 13, 2020 Decision Making Multi-Armed Bandits
— Unverified 0A General Theory of the Stochastic Linear Bandit and Its Applications Feb 12, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles Feb 12, 2020 Multi-Armed Bandits regression
— Unverified 0Adversarial Attacks on Linear Contextual Bandits Feb 10, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Inference for Batched Bandits Feb 8, 2020 Multi-Armed Bandits
— Unverified 0Selfish Robustness and Equilibria in Multi-Player Bandits Feb 4, 2020 Multi-Armed Bandits
— Unverified 0The Price of Incentivizing Exploration: A Characterization via Thompson Sampling and Sample Complexity Feb 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Safe Exploration for Optimizing Contextual Bandits Feb 2, 2020 counterfactual Information Retrieval
Code Code Available 0A Closer Look at Small-loss Bounds for Bandits with Graph Feedback Feb 2, 2020 Multi-Armed Bandits
— Unverified 0Efficient and Robust Algorithms for Adversarial Linear Contextual Bandits Feb 1, 2020 Multi-Armed Bandits
— Unverified 0Bandits with Knapsacks beyond the Worst-Case Feb 1, 2020 Multi-Armed Bandits
— Unverified 0Ballooning Multi-Armed Bandits Jan 24, 2020 Multi-Armed Bandits
— Unverified 0Incentivising Exploration and Recommendations for Contextual Bandits with Payments Jan 22, 2020 Multi-Armed Bandits
— Unverified 0Exploration Through Bias: Revisiting Biased Maximum Likelihood Estimation in Stochastic Multi-Armed Bandits Jan 1, 2020 Multi-Armed Bandits
— Unverified 0Gradient-free Online Learning in Continuous Games with Delayed Rewards Jan 1, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits Jan 1, 2020 Multi-Armed Bandits
— Unverified 0A Modern Introduction to Online Learning Dec 31, 2019 All Multi-Armed Bandits
Code Code Available 1Fair Contextual Multi-Armed Bandits: Theory and Experiments Dec 13, 2019 Decision Making Fairness
— Unverified 0Sublinear Optimal Policy Value Estimation in Contextual Bandits Dec 12, 2019 Multi-Armed Bandits
— Unverified 0Surrogate Objectives for Batch Policy Optimization in One-step Decision Making Dec 1, 2019 Decision Making Multi-Armed Bandits
— Unverified 0Offline Contextual Bandits with High Probability Fairness Guarantees Dec 1, 2019 Fairness Multi-Armed Bandits
Code Code Available 0Learning in Generalized Linear Contextual Bandits with Stochastic Delays Dec 1, 2019 Multi-Armed Bandits
— Unverified 0Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric Dec 1, 2019 Multi-Armed Bandits
— Unverified 0Epsilon-Best-Arm Identification in Pay-Per-Reward Multi-Armed Bandits Dec 1, 2019 Multi-Armed Bandits
— Unverified 0Thompson Sampling for Multinomial Logit Contextual Bandits Dec 1, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Contextual Combinatorial Conservative Bandits Nov 26, 2019 Multi-Armed Bandits
— Unverified 0Automatic Ensemble Learning for Online Influence Maximization Nov 25, 2019 Ensemble Learning Multi-Armed Bandits
— Unverified 0Corruption-robust exploration in episodic reinforcement learning Nov 20, 2019 Multi-Armed Bandits reinforcement-learning
— Unverified 0Contextual Bandits Evolving Over Finite Time Nov 14, 2019 Multi-Armed Bandits
— Unverified 0