Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models Feb 16, 2021 Decision Making Meta Reinforcement Learning
— Unverified 0Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment Feb 16, 2021 Thompson Sampling
— Unverified 0The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling Feb 16, 2021 Decision Making LEMMA
— Unverified 0Meta-Thompson Sampling Feb 11, 2021 Efficient Exploration Meta-Learning
— Unverified 0On the Suboptimality of Thompson Sampling in High Dimensions Feb 10, 2021 Thompson Sampling Vocal Bursts Intensity Prediction
Code Code Available 0State-Aware Variational Thompson Sampling for Deep Q-Networks Feb 7, 2021 Thompson Sampling
Code Code Available 0Doubly robust Thompson sampling for linear payoffs Feb 1, 2021 Thompson Sampling
— Unverified 0Weak Signal Asymptotics for Sequentially Randomized Experiments Jan 25, 2021 Thompson Sampling
— Unverified 0An empirical evaluation of active inference in multi-armed bandits Jan 21, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 1Scalable Optimization for Wind Farm Control using Coordination Graphs Jan 19, 2021 Thompson Sampling
Code Code Available 0TSEC: a framework for online experimentation under experimental constraints Jan 17, 2021 Portfolio Optimization Thompson Sampling
— Unverified 0Deciding What to Learn: A Rate-Distortion Approach Jan 15, 2021 Decision Making Sequential Decision Making
— Unverified 0Etat de l'art sur l'application des bandits multi-bras Jan 4, 2021 Thompson Sampling
— Unverified 0Meta-Reinforcement Learning With Informed Policy Regularization Jan 1, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Learning to Play Imperfect-Information Games by Imitating an Oracle Planner Dec 22, 2020 Thompson Sampling
Code Code Available 0Aging Bandits: Regret Analysis and Order-Optimal Learning Algorithm for Wireless Networks with Stochastic Arrivals Dec 16, 2020 Thompson Sampling
— Unverified 0Mercer Features for Efficient Combinatorial Bayesian Optimization Dec 14, 2020 Bayesian Optimization Thompson Sampling
Code Code Available 1Reinforcement Learning with Subspaces using Free Energy Paradigm Dec 13, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Optimal Thompson Sampling strategies for support-aware CVaR bandits Dec 10, 2020 Thompson Sampling
Code Code Available 1Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Non-Stationary Latent Bandits Dec 1, 2020 Recommendation Systems Thompson Sampling
— Unverified 0On Efficiency in Hierarchical Reinforcement Learning Dec 1, 2020 Computational Efficiency Decision Making
— Unverified 0Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning Nov 29, 2020 Action Generation Decision Making
— Unverified 0Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Nov 16, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0Risk-Constrained Thompson Sampling for CVaR Bandits Nov 16, 2020 Decision Making Thompson Sampling
— Unverified 0Accelerating Grasp Exploration by Leveraging Learned Priors Nov 11, 2020 Object Thompson Sampling
— Unverified 0Thompson sampling for linear quadratic mean-field teams Nov 9, 2020 Thompson Sampling
— Unverified 0Multi-Agent Active Search using Realistic Depth-Aware Noise Model Nov 9, 2020 object-detection Object Detection
Code Code Available 0Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adaptive Combinatorial Allocation Nov 4, 2020 Thompson Sampling
— Unverified 0Multi-armed Bandits with Cost Subsidy Nov 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Greedy k-Center from Noisy Distance Samples Nov 3, 2020 Thompson Sampling
— Unverified 0Screening for an Infectious Disease as a Problem in Stochastic Control Nov 1, 2020 Thompson Sampling
— Unverified 0Bandit Policies for Reliable Cellular Network Handovers in Extreme Mobility Oct 28, 2020 Thompson Sampling
— Unverified 0Sub-sampling for Efficient Non-Parametric Bandit Exploration Oct 27, 2020 Thompson Sampling
Code Code Available 0Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration Oct 23, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Bayesian Algorithms for Decentralized Stochastic Bandits Oct 20, 2020 Thompson Sampling
Code Code Available 0Federated Bayesian Optimization via Thompson Sampling Oct 20, 2020 Bayesian Optimization Computational Efficiency
Code Code Available 1Reinforcement Learning for Efficient and Tuning-Free Link Adaptation Oct 16, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Double-Linear Thompson Sampling for Context-Attentive Bandits Oct 15, 2020 Medical Diagnosis Thompson Sampling
— Unverified 0Asynchronous ε-Greedy Bayesian Optimisation Oct 15, 2020 Bayesian Optimisation Thompson Sampling
Code Code Available 0Online Learning and Distributed Control for Residential Demand Response Oct 11, 2020 Stochastic Optimization Thompson Sampling
— Unverified 0Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Oct 7, 2020 Thompson Sampling
— Unverified 0Neural Thompson Sampling Oct 2, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 1Stage-wise Conservative Linear Bandits Sep 30, 2020 Form Thompson Sampling
— Unverified 0Neural Model-based Optimization with Right-Censored Observations Sep 29, 2020 model regression
— Unverified 0Position-Based Multiple-Play Bandits with Thompson Sampling Sep 28, 2020 Position Recommendation Systems
— Unverified 0Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control Sep 24, 2020 Change Point Detection Computational Efficiency
— Unverified 0Partially Observable Online Change Detection via Smooth-Sparse Decomposition Sep 22, 2020 Bayesian Inference Change Detection
— Unverified 0Bandits Under The Influence (Extended Version) Sep 21, 2020 Recommendation Systems Thompson Sampling
— Unverified 0