Randomised Bayesian Least-Squares Policy Iteration Apr 6, 2019 Thompson Sampling
— Unverified 0Sampling Acquisition Functions for Batch Bayesian Optimization Mar 22, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0On Multi-Armed Bandit Designs for Dose-Finding Clinical Trials Mar 17, 2019 Thompson Sampling
— Unverified 0Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics Mar 11, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Meta Dynamic Pricing: Transfer Learning Across Experiments Feb 28, 2019 Thompson Sampling Transfer Learning
— Unverified 0Constrained Thompson Sampling for Wireless Link Optimization Feb 28, 2019 Thompson Sampling
— Unverified 0Fully Distributed Bayesian Optimization with Stochastic Policies Feb 26, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Multi-Armed Bandit Strategies for Non-Stationary Reward Distributions and Delayed Feedback Processes Feb 22, 2019 Thompson Sampling
— Unverified 0Scalable Thompson Sampling via Optimal Transport Feb 19, 2019 Decision Making Sequential Decision Making
— Unverified 0Thompson Sampling with Information Relaxation Penalties Feb 12, 2019 Thompson Sampling
Code Code Available 0KLUCB Approach to Copeland Bandits Feb 7, 2019 Information Retrieval Reinforcement Learning
— Unverified 0First-Order Bayesian Regret Analysis of Thompson Sampling Feb 2, 2019 Combinatorial Optimization Thompson Sampling
— Unverified 0Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model Jan 31, 2019 Recommendation Systems Thompson Sampling
— Unverified 0Thompson Sampling for a Fatigue-aware Online Recommendation System Jan 23, 2019 Thompson Sampling
Code Code Available 0Parallel Contextual Bandits in Wireless Handover Optimization Jan 21, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Information-Directed Exploration for Deep Reinforcement Learning Dec 18, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 0MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation Dec 11, 2018 Information Retrieval Online Ranker Evaluation
Code Code Available 0Thompson Sampling for Noncompliant Bandits Dec 3, 2018 Thompson Sampling
— Unverified 0Bandit Learning with Implicit Feedback Dec 1, 2018 Bayesian Inference Thompson Sampling
Code Code Available 0Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks Nov 27, 2018 Thompson Sampling
— Unverified 0Adapting multi-armed bandits policies to contextual bandits scenarios Nov 11, 2018 Binary Classification Classification
Code Code Available 0Thompson Sampling for Pursuit-Evasion Problems Nov 11, 2018 Thompson Sampling
— Unverified 0Practical Bayesian Learning of Neural Networks via Adaptive Optimisation Methods Nov 8, 2018 Multi-Armed Bandits Thompson Sampling
Code Code Available 0A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting Oct 18, 2018 Thompson Sampling
— Unverified 0Combining Bayesian Optimization and Lipschitz Optimization Oct 10, 2018 Bayesian Optimization global-optimization
— Unverified 0Thompson Sampling Algorithms for Cascading Bandits Oct 2, 2018 Efficient Exploration Multi-Armed Bandits
— Unverified 0Contextual Multi-Armed Bandits for Causal Marketing Oct 2, 2018 Causal Inference counterfactual
— Unverified 0Efficient Linear Bandits through Matrix Sketching Sep 28, 2018 Thompson Sampling
— Unverified 0Incorporating Behavioral Constraints in Online AI Systems Sep 15, 2018 Thompson Sampling
— Unverified 0Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms Sep 7, 2018 Thompson Sampling
— Unverified 0Adaptive Grey-Box Fuzz-Testing with Thompson Sampling Aug 24, 2018 Thompson Sampling
— Unverified 0Nonparametric Gaussian Mixture Models for the Multi-Armed Bandit Aug 8, 2018 Density Estimation Multi-Armed Bandits
Code Code Available 0Sequential Monte Carlo Bandits Aug 8, 2018 Decision Making Sequential Decision Making
Code Code Available 0Deep Contextual Multi-armed Bandits Jul 25, 2018 Marketing Multi-Armed Bandits
— Unverified 0Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits Jul 19, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 0Optimization of a SSP's Header Bidding Strategy using Thompson Sampling Jul 9, 2018 Thompson Sampling
— Unverified 0Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems Jul 1, 2018 Reinforcement Learning Thompson Sampling
— Unverified 0On The Differential Privacy of Thompson Sampling With Gaussian Prior Jun 24, 2018 Thompson Sampling
— Unverified 0Randomized Value Functions via Multiplicative Normalizing Flows Jun 6, 2018 Efficient Exploration Thompson Sampling
Code Code Available 0Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling Jun 4, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0An Information-Theoretic Analysis for Thompson Sampling with Many Actions May 30, 2018 Thompson Sampling
— Unverified 0Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming May 25, 2018 Bayesian Inference Multi-Armed Bandits
Code Code Available 0New Insights into Bootstrapping for Bandits May 24, 2018 Thompson Sampling
— Unverified 0Analysis of Thompson Sampling for Graphical Bandits Without the Graphs May 23, 2018 Thompson Sampling
— Unverified 0PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits May 18, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 0Profitable Bandits May 8, 2018 Management Thompson Sampling
— Unverified 0Thompson Sampling for Combinatorial Semi-Bandits Mar 13, 2018 Thompson Sampling
— Unverified 0Active Reinforcement Learning with Monte-Carlo Tree Search Mar 13, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Satisficing in Time-Sensitive Bandit Learning Mar 7, 2018 Thompson Sampling
— Unverified 0Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0