Variational Bayesian Optimistic Sampling Oct 29, 2021 Thompson Sampling
— Unverified 0Differentially Private Federated Bayesian Optimization with Distributed Exploration Oct 27, 2021 Bayesian Optimization Federated Learning
— Unverified 0Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Diversified Sampling for Batched Bayesian Optimization with Determinantal Point Processes Oct 22, 2021 Bayesian Optimization Diversity
— Unverified 0Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations Oct 19, 2021 Decision Making Model Selection
Code Code Available 0EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits Oct 7, 2021 Multi-Armed Bandits Thompson Sampling
Code Code Available 1Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning Oct 2, 2021 Multi-Armed Bandits regression
— Unverified 0Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Regularized-OFU: an efficient algorithm for general contextual bandit with optimization oracles Sep 29, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Expected Improvement-based Contextual Bandits Sep 29, 2021 Bayesian Optimization Multi-Armed Bandits
— Unverified 0Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification Sep 29, 2021 Binary Classification Thompson Sampling
— Unverified 0Deep Exploration for Recommendation Systems Sep 26, 2021 Recommendation Systems Thompson Sampling
— Unverified 0Vaccine allocation policy optimization and budget sharing mechanism using Thompson sampling Sep 21, 2021 Decision Making Management
Code Code Available 0Online Learning of Network Bottlenecks via Minimax Paths Sep 17, 2021 Thompson Sampling
— Unverified 0Machine Learning for Online Algorithm Selection under Censored Feedback Sep 13, 2021 BIG-bench Machine Learning Thompson Sampling
Code Code Available 0Thompson Sampling for Bandits with Clustered Arms Sep 6, 2021 Clustering Thompson Sampling
— Unverified 0A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits Aug 25, 2021 Thompson Sampling
Code Code Available 0A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems Aug 19, 2021 Thompson Sampling
— Unverified 0Scalable regret for learning to control network-coupled subsystems with unknown dynamics Aug 18, 2021 Thompson Sampling
— Unverified 0Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models Aug 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Debiasing Samples from Online Learning Using Bootstrap Jul 31, 2021 Off-policy evaluation Thompson Sampling
— Unverified 0Adaptively Optimize Content Recommendation Using Multi Armed Bandit Algorithms in E-commerce Jul 30, 2021 Thompson Sampling
— Unverified 0From Predictions to Decisions: The Importance of Joint Predictive Distributions Jul 20, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0GuideBoot: Guided Bootstrap for Deep Contextual Bandits Jul 18, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0No Regrets for Learning the Prior in Bandits Jul 13, 2021 Thompson Sampling
— Unverified 0Metalearning Linear Bandits by Prior Update Jul 12, 2021 Decision Making Sequential Decision Making
— Unverified 0Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 0Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow Jul 1, 2021 Decision Making Marketing
— Unverified 0Random Effect Bandits Jun 23, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Thompson Sampling for Unimodal Bandits Jun 15, 2021 Thompson Sampling
— Unverified 0Thompson Sampling with a Mixture Prior Jun 10, 2021 Decision Making Multi-Task Learning
— Unverified 0Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian? Jun 5, 2021 Thompson Sampling
— Unverified 0A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms Jun 3, 2021 Thompson Sampling
— Unverified 0Parallelizing Thompson Sampling Jun 2, 2021 Decision Making Thompson Sampling
— Unverified 0Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits May 30, 2021 Edge-computing Portfolio Optimization
— Unverified 0Asymptotically Optimal Bandits under Weighted Information May 28, 2021 Thompson Sampling
— Unverified 0Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Thompson Sampling for Gaussian Entropic Risk Bandits May 14, 2021 Decision Making Thompson Sampling
— Unverified 0Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks May 10, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 1Dynamic Slate Recommendation with Gated Recurrent Units and Thompson Sampling Apr 30, 2021 Recommendation Systems Thompson Sampling
Code Code Available 1High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling Apr 23, 2021 Bayesian Inference Drug Discovery
— Unverified 0When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution Apr 14, 2021 Bayesian Inference Collaborative Filtering
— Unverified 0Blind Exploration and Exploitation of Stochastic Experts Apr 2, 2021 Thompson Sampling
— Unverified 0Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments Mar 22, 2021 Thompson Sampling
— Unverified 0Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection Mar 9, 2021 Thompson Sampling
— Unverified 0Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure Mar 2, 2021 Efficient Exploration Thompson Sampling
Code Code Available 0Automated Creative Optimization for E-Commerce Advertising Feb 28, 2021 AutoML Click-Through Rate Prediction
Code Code Available 0Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 0