Thompson Sampling for Gaussian Entropic Risk Bandits May 14, 2021 Decision Making Thompson Sampling
— Unverified 0High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling Apr 23, 2021 Bayesian Inference Drug Discovery
— Unverified 0When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution Apr 14, 2021 Bayesian Inference Collaborative Filtering
— Unverified 0Blind Exploration and Exploitation of Stochastic Experts Apr 2, 2021 Thompson Sampling
— Unverified 0Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments Mar 22, 2021 Thompson Sampling
— Unverified 0Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection Mar 9, 2021 Thompson Sampling
— Unverified 0Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure Mar 2, 2021 Efficient Exploration Thompson Sampling
Code Code Available 0Automated Creative Optimization for E-Commerce Advertising Feb 28, 2021 AutoML Click-Through Rate Prediction
Code Code Available 0Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 0Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models Feb 16, 2021 Decision Making Meta Reinforcement Learning
— Unverified 0Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment Feb 16, 2021 Thompson Sampling
— Unverified 0The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling Feb 16, 2021 Decision Making LEMMA
— Unverified 0Meta-Thompson Sampling Feb 11, 2021 Efficient Exploration Meta-Learning
— Unverified 0On the Suboptimality of Thompson Sampling in High Dimensions Feb 10, 2021 Thompson Sampling Vocal Bursts Intensity Prediction
Code Code Available 0State-Aware Variational Thompson Sampling for Deep Q-Networks Feb 7, 2021 Thompson Sampling
Code Code Available 0Doubly robust Thompson sampling for linear payoffs Feb 1, 2021 Thompson Sampling
— Unverified 0Weak Signal Asymptotics for Sequentially Randomized Experiments Jan 25, 2021 Thompson Sampling
— Unverified 0Scalable Optimization for Wind Farm Control using Coordination Graphs Jan 19, 2021 Thompson Sampling
Code Code Available 0TSEC: a framework for online experimentation under experimental constraints Jan 17, 2021 Portfolio Optimization Thompson Sampling
— Unverified 0Deciding What to Learn: A Rate-Distortion Approach Jan 15, 2021 Decision Making Sequential Decision Making
— Unverified 0Etat de l'art sur l'application des bandits multi-bras Jan 4, 2021 Thompson Sampling
— Unverified 0Meta-Reinforcement Learning With Informed Policy Regularization Jan 1, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Learning to Play Imperfect-Information Games by Imitating an Oracle Planner Dec 22, 2020 Thompson Sampling
Code Code Available 0Aging Bandits: Regret Analysis and Order-Optimal Learning Algorithm for Wireless Networks with Stochastic Arrivals Dec 16, 2020 Thompson Sampling
— Unverified 0Reinforcement Learning with Subspaces using Free Energy Paradigm Dec 13, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0On Efficiency in Hierarchical Reinforcement Learning Dec 1, 2020 Computational Efficiency Decision Making
— Unverified 0Non-Stationary Latent Bandits Dec 1, 2020 Recommendation Systems Thompson Sampling
— Unverified 0Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning Nov 29, 2020 Action Generation Decision Making
— Unverified 0Risk-Constrained Thompson Sampling for CVaR Bandits Nov 16, 2020 Decision Making Thompson Sampling
— Unverified 0Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Nov 16, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0Accelerating Grasp Exploration by Leveraging Learned Priors Nov 11, 2020 Object Thompson Sampling
— Unverified 0Multi-Agent Active Search using Realistic Depth-Aware Noise Model Nov 9, 2020 object-detection Object Detection
Code Code Available 0Thompson sampling for linear quadratic mean-field teams Nov 9, 2020 Thompson Sampling
— Unverified 0Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adaptive Combinatorial Allocation Nov 4, 2020 Thompson Sampling
— Unverified 0Greedy k-Center from Noisy Distance Samples Nov 3, 2020 Thompson Sampling
— Unverified 0Multi-armed Bandits with Cost Subsidy Nov 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Screening for an Infectious Disease as a Problem in Stochastic Control Nov 1, 2020 Thompson Sampling
— Unverified 0Bandit Policies for Reliable Cellular Network Handovers in Extreme Mobility Oct 28, 2020 Thompson Sampling
— Unverified 0Sub-sampling for Efficient Non-Parametric Bandit Exploration Oct 27, 2020 Thompson Sampling
Code Code Available 0Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration Oct 23, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Bayesian Algorithms for Decentralized Stochastic Bandits Oct 20, 2020 Thompson Sampling
Code Code Available 0Reinforcement Learning for Efficient and Tuning-Free Link Adaptation Oct 16, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Double-Linear Thompson Sampling for Context-Attentive Bandits Oct 15, 2020 Medical Diagnosis Thompson Sampling
— Unverified 0Asynchronous ε-Greedy Bayesian Optimisation Oct 15, 2020 Bayesian Optimisation Thompson Sampling
Code Code Available 0Online Learning and Distributed Control for Residential Demand Response Oct 11, 2020 Stochastic Optimization Thompson Sampling
— Unverified 0Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Oct 7, 2020 Thompson Sampling
— Unverified 0Stage-wise Conservative Linear Bandits Sep 30, 2020 Form Thompson Sampling
— Unverified 0Neural Model-based Optimization with Right-Censored Observations Sep 29, 2020 model regression
— Unverified 0