Making Sense of Reinforcement Learning and Probabilistic Inference Jan 3, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Randomized Exploration for Non-Stationary Stochastic Linear Bandits Dec 11, 2019 Computational Efficiency Thompson Sampling
Code Code Available 0Solving Bernoulli Rank-One Bandits with Unimodal Thompson Sampling Dec 6, 2019 Thompson Sampling
— Unverified 0Ordinal Bayesian Optimisation Dec 5, 2019 Bayesian Optimisation Thompson Sampling
— Unverified 0Thompson Sampling and Approximate Inference Dec 1, 2019 Decision Making Thompson Sampling
— Unverified 0Thompson Sampling for Multinomial Logit Contextual Bandits Dec 1, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Bayesian Optimization for Categorical and Category-Specific Continuous Inputs Nov 28, 2019 Bayesian Optimization BIG-bench Machine Learning
Code Code Available 0Automatic Ensemble Learning for Online Influence Maximization Nov 25, 2019 Ensemble Learning Multi-Armed Bandits
— Unverified 0Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures Nov 22, 2019 Thompson Sampling
Code Code Available 0Information-Theoretic Confidence Bounds for Reinforcement Learning Nov 21, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Portfolio by Solving Multi-armed Bandit via Thompson Sampling Nov 13, 2019 Decision Making Management
— Unverified 0Incentivized Exploration for Multi-Armed Bandits under Reward Drift Nov 12, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Safe Linear Thompson Sampling with Side Information Nov 6, 2019 Thompson Sampling
— Unverified 0On Online Learning in Kernelized Markov Decision Processes Nov 4, 2019 Thompson Sampling
— Unverified 0On Batch Bayesian Optimization Nov 4, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints Nov 2, 2019 Bayesian Optimization Decision Making
— Unverified 0Thompson Sampling via Local Uncertainty Oct 30, 2019 Decision Making Multi-Armed Bandits
Code Code Available 0Fixed-Confidence Guarantees for Bayesian Best-Arm Identification Oct 24, 2019 Thompson Sampling
— Unverified 0Thompson Sampling in Non-Episodic Restless Bandits Oct 12, 2019 Open-Ended Question Answering Thompson Sampling
— Unverified 0Regret Analysis of Bandit Problems with Causal Background Knowledge Oct 11, 2019 Thompson Sampling
— Unverified 0Old Dog Learns New Tricks: Randomized UCB for Bandit Problems Oct 11, 2019 Thompson Sampling
Code Code Available 0Robust Dynamic Assortment Optimization in the Presence of Outlier Customers Oct 9, 2019 Assortment Optimization Thompson Sampling
— Unverified 0A Quantile-based Approach for Hyperparameter Transfer Learning Sep 30, 2019 Bayesian Optimization Hyperparameter Optimization
— Unverified 0A Copula approach for hyperparameter transfer learning Sep 25, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Efficient Multivariate Bandit Algorithm with Path Planning Sep 6, 2019 Heuristic Search Thompson Sampling
— Unverified 0An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits Sep 5, 2019 Decision Making Recommendation Systems
— Unverified 0Online Causal Inference for Advertising in Real-Time Bidding Auctions Aug 22, 2019 Causal Inference Experimental Design
— Unverified 0A Batched Multi-Armed Bandit Approach to News Headline Testing Aug 17, 2019 Articles Thompson Sampling
— Unverified 0A Bayesian Choice Model for Eliminating Feedback Loops Aug 15, 2019 Recommendation Systems Thompson Sampling
— Unverified 0Thompson Sampling with Approximate Inference Aug 14, 2019 Decision Making Thompson Sampling
— Unverified 0Scaling Multi-Armed Bandit Algorithms Jul 25, 2019 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Convergence Rates of Posterior Distributions in Markov Decision Process Jul 22, 2019 Thompson Sampling
— Unverified 0Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning Jul 11, 2019 Thompson Sampling
Code Code Available 0Thompson Sampling on Symmetric α-Stable Bandits Jul 8, 2019 Bayesian Inference Decision Making
— Unverified 0Thompson Sampling for Combinatorial Network Optimization in Unknown Environments Jul 7, 2019 Combinatorial Optimization Thompson Sampling
— Unverified 0Mixed-Variable Bayesian Optimization Jul 2, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Bandit Learning for Diversified Interactive Recommendation Jul 1, 2019 Bayesian Inference Diversity
— Unverified 0Thompson Sampling for Adversarial Bit Prediction Jun 21, 2019 Prediction Thompson Sampling
— Unverified 0Revised Progressive-Hedging-Algorithm Based Two-layer Solution Scheme for Bayesian Reinforcement Learning Jun 21, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Sparse Spectrum Gaussian Process for Bayesian Optimization Jun 21, 2019 Bayesian Optimisation Bayesian Optimization
— Unverified 0Stochastic Neural Network with Kronecker Flow Jun 10, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation Jun 4, 2019 Recommendation Systems Thompson Sampling
— Unverified 0Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems May 29, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Connections Between Mirror Descent, Thompson Sampling and the Information Ratio May 28, 2019 Thompson Sampling
— Unverified 0Feedback graph regret bounds for Thompson Sampling and UCB May 23, 2019 Thompson Sampling
— Unverified 0Adaptive Model Selection Framework: An Application to Airline Pricing May 21, 2019 Model Selection Thompson Sampling
— Unverified 0Adaptive Sensor Placement for Continuous Spaces May 16, 2019 Thompson Sampling
— Unverified 0On the Performance of Thompson Sampling on Logistic Bandits May 12, 2019 Thompson Sampling
— Unverified 0Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling May 10, 2019 Thompson Sampling
Code Code Available 0AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning Apr 8, 2019 Bayesian Optimization Inductive Bias
— Unverified 0