Position-Based Multiple-Play Bandits with Thompson Sampling Sep 28, 2020 Position Recommendation Systems
— Unverified 0Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control Sep 24, 2020 Change Point Detection Computational Efficiency
— Unverified 0Partially Observable Online Change Detection via Smooth-Sparse Decomposition Sep 22, 2020 Bayesian Inference Change Detection
— Unverified 0Bandits Under The Influence (Extended Version) Sep 21, 2020 Recommendation Systems Thompson Sampling
— Unverified 0Causal Bandits without prior knowledge using separating sets Sep 16, 2020 Causal Discovery Decision Making
— Unverified 0Thompson Sampling for Unsupervised Sequential Selection Sep 16, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits Sep 6, 2020 Change Detection Thompson Sampling
— Unverified 0Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling Aug 24, 2020 Deep Reinforcement Learning Thompson Sampling
— Unverified 0Contextual Bandits for Advertising Budget Allocation Aug 22, 2020 Marketing Multi-Armed Bandits
— Unverified 0Near Optimal Adversarial Attacks on Stochastic Bandits and Defenses with Smoothed Responses Aug 21, 2020 Adversarial Attack Thompson Sampling
— Unverified 0Reinforcement Learning with Trajectory Feedback Aug 13, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Lenient Regret for Multi-Armed Bandits Aug 10, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0IntelligentPooling: Practical Thompson Sampling for mHealth Jul 31, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Greedy Bandits with Sampled Context Jul 27, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems Jul 9, 2020 Thompson Sampling
— Unverified 0Variable Selection via Thompson Sampling Jul 1, 2020 BIG-bench Machine Learning Interpretable Machine Learning
— Unverified 0Policy Gradient Optimization of Thompson Sampling Policies Jun 30, 2020 Policy Gradient Methods Thompson Sampling
— Unverified 0Asynchronous Multi Agent Active Search Jun 25, 2020 Bayesian Optimization Compressive Sensing
— Unverified 0Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect Jun 18, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Constrained Thompson Sampling for Real-Time Electricity Pricing with Grid Reliability Constraints Jun 17, 2020 Thompson Sampling
— Unverified 0Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring Jun 17, 2020 Decision Making Thompson Sampling
— Unverified 0Latent Bandits Revisited Jun 15, 2020 Recommendation Systems Thompson Sampling
— Unverified 0Hypermodels for Exploration Jun 12, 2020 Thompson Sampling
— Unverified 0TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation Jun 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0On Frequentist Regret of Linear Thompson Sampling Jun 11, 2020 Thompson Sampling
— Unverified 0Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits Jun 11, 2020 Thompson Sampling
— Unverified 0Scalable Thompson Sampling using Sparse Gaussian Process Models Jun 9, 2020 Thompson Sampling
— Unverified 0Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization Jun 8, 2020 Bayesian Optimization Thompson Sampling
— Unverified 0An Efficient Algorithm For Generalized Linear Bandit: Online Stochastic Gradient Descent and Thompson Sampling Jun 7, 2020 Thompson Sampling
— Unverified 0Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs Jun 5, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Thompson Sampling for Combinatorial Semi-bandits with Sleeping Arms and Long-Term Fairness Constraints May 14, 2020 Fairness Movie Recommendation
— Unverified 0Learning to Rank in the Position Based Model with Bandit Feedback Apr 27, 2020 Learning-To-Rank Multi-Armed Bandits
— Unverified 0Online Learning with Cumulative Oversampling: Application to Budgeted Influence Maximization Apr 24, 2020 Thompson Sampling
— Unverified 0Adaptive Operator Selection Based on Dynamic Thompson Sampling for MOEA/D Apr 22, 2020 Thompson Sampling
— Unverified 0Thompson Sampling for Linearly Constrained Bandits Apr 20, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Optimal No-regret Learning in Repeated First-price Auctions Mar 22, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Reliability-aware Multi-armed Bandit Approach to Learn and Select Users in Demand Response Mar 20, 2020 Avg Thompson Sampling
— Unverified 0Delay-Adaptive Learning in Generalized Linear Contextual Bandits Mar 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Online Residential Demand Response via Contextual Multi-Armed Bandits Mar 7, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Odds-Ratio Thompson Sampling to Control for Time-Varying Effect Mar 4, 2020 Thompson Sampling
Code Code Available 0An Online Learning Framework for Energy-Efficient Navigation of Electric Vehicles Mar 3, 2020 Navigate Thompson Sampling
— Unverified 0MOTS: Minimax Optimal Thompson Sampling Mar 3, 2020 Thompson Sampling
— Unverified 0Efficient exploration of zero-sum stochastic games Feb 24, 2020 Efficient Exploration Thompson Sampling
— Unverified 0On Thompson Sampling with Langevin Algorithms Feb 23, 2020 Thompson Sampling
— Unverified 0Residual Bootstrap Exploration for Bandit Algorithms Feb 19, 2020 Computational Efficiency Multi-Armed Bandits
— Unverified 0A General Theory of the Stochastic Linear Bandit and Its Applications Feb 12, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0The Price of Incentivizing Exploration: A Characterization via Thompson Sampling and Sample Complexity Feb 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Thompson Sampling Algorithms for Mean-Variance Bandits Feb 1, 2020 Decision Making Thompson Sampling
Code Code Available 0Bayesian Quantile and Expectile Optimisation Jan 12, 2020 Bayesian Optimisation Gaussian Processes
— Unverified 0On Thompson Sampling for Smoother-than-Lipschitz Bandits Jan 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0