Unreliable Multi-Armed Bandits: A Novel Approach to Recommendation Systems Nov 14, 2019 Multi-Armed Bandits Recommendation Systems
— Unverified 0Triply Robust Off-Policy Evaluation Nov 13, 2019 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Incentivized Exploration for Multi-Armed Bandits under Reward Drift Nov 12, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Neural Contextual Bandits with UCB-based Exploration Nov 11, 2019 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Confidence Intervals for Policy Evaluation in Adaptive Experiments Nov 7, 2019 Experimental Design Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits with Correlated Arms Nov 6, 2019 Multi-Armed Bandits
Code Code Available 0Persistency of Excitation for Robustness of Neural Networks Nov 4, 2019 Multi-Armed Bandits
Code Code Available 0Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs Nov 3, 2019 Multi-Armed Bandits reinforcement-learning
— Unverified 0Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints Nov 2, 2019 Bayesian Optimization Decision Making
— Unverified 0Thompson Sampling via Local Uncertainty Oct 30, 2019 Decision Making Multi-Armed Bandits
Code Code Available 0Trend-responsive User Segmentation Enabling Traceable Publishing Insights. A Case Study of a Real-world Large-scale News Recommendation System Oct 28, 2019 Diversity global-optimization
— Unverified 0BanditRank: Learning to Rank Using Contextual Bandits Oct 23, 2019 Information Retrieval Learning-To-Rank
— Unverified 0Smoothness-Adaptive Contextual Bandits Oct 22, 2019 Decision Making Multi-Armed Bandits
Code Code Available 0Multi-User MABs with User Dependent Rewards for Uncoordinated Spectrum Access Oct 21, 2019 Multi-Armed Bandits
— Unverified 0Decentralized Heterogeneous Multi-Player Multi-Armed Bandits with Non-Zero Rewards on Collisions Oct 21, 2019 Multi-Armed Bandits
— Unverified 0Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Oct 15, 2019 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Adaptive Exploration in Linear Contextual Bandit Oct 15, 2019 Decision Making Multi-Armed Bandits
— Unverified 0An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays Oct 14, 2019 Multi-Armed Bandits
— Unverified 0Regret Bounds for Batched Bandits Oct 11, 2019 Multi-Armed Bandits
— Unverified 0Privacy-Preserving Multi-Party Contextual Bandits Oct 11, 2019 Multi-Armed Bandits Privacy Preserving
— Unverified 0Social Learning in Multi Agent Multi Armed Bandits Oct 4, 2019 Multi-Armed Bandits
— Unverified 0Decision Automation for Electric Power Network Recovery Oct 1, 2019 Decision Making Multi-Armed Bandits
— Unverified 0An Optimal Algorithm for Multiplayer Multi-Armed Bandits Sep 28, 2019 Multi-Armed Bandits
— Unverified 0NeuralUCB: Contextual Bandits with Neural Network-Based Exploration Sep 25, 2019 Efficient Exploration Multi-Armed Bandits
— Unverified 0Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching Sep 25, 2019 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Learning Effective Exploration Strategies For Contextual Bandits Sep 25, 2019 Imitation Learning Learning-To-Rank
— Unverified 0Practical Calculation of Gittins Indices for Multi-armed Bandits Sep 11, 2019 Multi-Armed Bandits
Code Code Available 0AutoML for Contextual Bandits Sep 7, 2019 AutoML Feature Engineering
— Unverified 0Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes Sep 5, 2019 Multi-Armed Bandits
Code Code Available 0Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 0A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits Aug 27, 2019 Change Detection Multi-Armed Bandits
— Unverified 0Nonparametric Contextual Bandits in an Unknown Metric Space Aug 3, 2019 Multi-Armed Bandits
— Unverified 0Doubly-Robust Lasso Bandit Jul 26, 2019 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Scaling Multi-Armed Bandit Algorithms Jul 25, 2019 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
— Unverified 0Parameterized Exploration Jul 13, 2019 Multi-Armed Bandits
— Unverified 0Productization Challenges of Contextual Multi-Armed Bandits Jul 10, 2019 Multi-Armed Bandits
— Unverified 0Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits Jul 7, 2019 Multi-Armed Bandits
— Unverified 0Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits Jul 2, 2019 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Fairness Constraints for Distributing Resources to Human Teammates Jun 30, 2019 Fairness Multi-Armed Bandits
— Unverified 0Bayesian Optimisation over Multiple Continuous and Categorical Inputs Jun 20, 2019 Bayesian Optimisation Diversity
Code Code Available 0Learning in Restless Multi-Armed Bandits via Adaptive Arm Sequencing Rules Jun 19, 2019 Multi-Armed Bandits
— Unverified 0Online Allocation and Pricing: Constant Regret via Bellman Inequalities Jun 14, 2019 Multi-Armed Bandits
— Unverified 0Competing Bandits in Matching Markets Jun 12, 2019 Multi-Armed Bandits
— Unverified 0Bootstrapping Upper Confidence Bound Jun 12, 2019 Decision Making Multi-Armed Bandits
— Unverified 0Beam Learning -- Using Machine Learning for Finding Beam Directions Jun 11, 2019 BIG-bench Machine Learning Multi-Armed Bandits
— Unverified 0Stochastic Neural Network with Kronecker Flow Jun 10, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Balanced off-policy evaluation in general action spaces Jun 9, 2019 Binary Classification counterfactual
— Unverified 0Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning Jun 9, 2019 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Empirical Likelihood for Contextual Bandits Jun 7, 2019 Multi-Armed Bandits
Code Code Available 0