Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs Jun 24, 2022 Thompson Sampling
— Unverified 0Analysis of Thompson Sampling for Controlling Unknown Linear Diffusion Processes Jun 20, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Thompson Sampling for (Combinatorial) Pure Exploration Jun 18, 2022 Thompson Sampling
— Unverified 0Thompson Sampling for Robust Transfer in Multi-Task Bandits Jun 17, 2022 Multi-Task Learning Thompson Sampling
Code Code Available 0Thompson Sampling Achieves O(T) Regret in Linear Quadratic Control Jun 17, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification Jun 16, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0On Provably Robust Meta-Bayesian Optimization Jun 14, 2022 Bayesian Optimization Meta-Learning
Code Code Available 0Top Two Algorithms Revisited Jun 13, 2022 Thompson Sampling Vocal Bursts Valence Prediction
— Unverified 0Regret Bounds for Information-Directed Reinforcement Learning Jun 9, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization Jun 5, 2022 BIG-bench Machine Learning Evolutionary Algorithms
— Unverified 0Incentivizing Combinatorial Bandit Exploration Jun 1, 2022 Thompson Sampling
— Unverified 0Mixed-Effect Thompson Sampling May 30, 2022 Thompson Sampling
Code Code Available 0Surrogate modeling for Bayesian optimization beyond a single Gaussian process May 27, 2022 Bayesian Optimization Drug Discovery
— Unverified 0Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits May 27, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Information-Directed Selection for Top-Two Algorithms May 24, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Fast Change Identification in Multi-Play Bandits and its Applications in Wireless Networks May 20, 2022 Change Detection Edge-computing
— Unverified 0Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization May 17, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization May 10, 2022 Bayesian Optimization Thompson Sampling
— Unverified 0Non-Stationary Bandit Learning via Predictive Sampling May 4, 2022 Attribute Thompson Sampling
— Unverified 0Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Apr 26, 2022 Decision Making Evolutionary Algorithms
Code Code Available 0Thompson Sampling for Bandit Learning in Matching Markets Apr 26, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0On Kernelized Multi-Armed Bandits with Constraints Mar 29, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Mar 24, 2022 Bayesian Optimization Decision Making
Code Code Available 0Thompson Sampling on Asymmetric α-Stable Bandits Mar 19, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Regenerative Particle Thompson Sampling Mar 15, 2022 Thompson Sampling
— Unverified 0Multi-Agent Active Search using Detection and Location Uncertainty Mar 9, 2022 Decision Making Disaster Response
— Unverified 0An Analysis of Ensemble Sampling Mar 2, 2022 Thompson Sampling
— Unverified 0Partial Likelihood Thompson Sampling Mar 2, 2022 Thompson Sampling
— Unverified 0Scalable Bayesian Optimization Using Vecchia Approximations of Gaussian Processes Mar 2, 2022 Bayesian Optimization Gaussian Processes
Code Code Available 0Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework Feb 26, 2022 Meta-Learning Thompson Sampling
— Unverified 0Thompson Sampling with Unrestricted Delays Feb 24, 2022 Thompson Sampling
— Unverified 0Double Thompson Sampling in Finite stochastic Games Feb 21, 2022 Thompson Sampling
— Unverified 0Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation Feb 18, 2022 Thompson Sampling
— Unverified 0Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network Feb 17, 2022 Bayesian Inference Generative Adversarial Network
— Unverified 0Synthetically Controlled Bandits Feb 14, 2022 Thompson Sampling
— Unverified 0Remote Contextual Bandits Feb 10, 2022 Marketing Multi-Armed Bandits
— Unverified 0Fourier Representations for Black-Box Optimization over Categorical Variables Feb 8, 2022 regression Thompson Sampling
— Unverified 0On learning Whittle index policy for restless bandits with scalable regret Feb 7, 2022 Scheduling Thompson Sampling
— Unverified 0Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems Feb 7, 2022 Decision Making Dimensionality Reduction
Code Code Available 0Tsetlin Machine for Solving Contextual Bandit Problems Feb 4, 2022 Thompson Sampling
Code Code Available 0Deep Hierarchy in Bandits Feb 3, 2022 Thompson Sampling
— Unverified 0Evaluating Deep Vs. Wide & Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations Jan 31, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Modeling Human Exploration Through Resource-Rational Reinforcement Learning Jan 27, 2022 Meta-Learning reinforcement-learning
Code Code Available 0Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems Jan 25, 2022 parameter estimation Thompson Sampling
— Unverified 0IBAC: An Intelligent Dynamic Bandwidth Channel Access Avoiding Outside Warning Range Problem Jan 15, 2022 Thompson Sampling
— Unverified 0On Dynamic Pricing with Covariates Dec 25, 2021 Thompson Sampling
— Unverified 0Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization Dec 15, 2021 Thompson Sampling
— Unverified 0