Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis Jan 21, 2024 Thompson Sampling
— Unverified 0Model-Free Approximate Bayesian Learning for Large-Scale Conversion Funnel Optimization Jan 12, 2024 Decision Making Marketing
— Unverified 0Decentralized Multi-Agent Active Search and Tracking when Targets Outnumber Agents Jan 6, 2024 Decision Making Thompson Sampling
— Unverified 0Improving sample efficiency of high dimensional Bayesian optimization with MCMC Jan 5, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Zero-Inflated Bandits Dec 25, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs Dec 24, 2023 Computational Efficiency Thompson Sampling
Code Code Available 0Best Arm Identification in Batched Multi-armed Bandit Problems Dec 21, 2023 Marketing Thompson Sampling
— Unverified 0Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit Dec 5, 2023 Thompson Sampling
— Unverified 0The Sliding Regret in Stochastic Bandits: Discriminating Index and Randomized Policies Nov 30, 2023 Thompson Sampling
— Unverified 0Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study Nov 24, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Probabilistic Inference in Reinforcement Learning Done Right Nov 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Distributed Neural Linear Thompson Sampling Framework to Achieve URLLC in Industrial IoT Nov 21, 2023 Scheduling Thompson Sampling
— Unverified 0Adaptive Interventions with User-Defined Goals for Health Behavior Change Nov 16, 2023 Thompson Sampling
Code Code Available 0Exploration via linearly perturbed loss minimisation Nov 13, 2023 Thompson Sampling
— Unverified 0Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds Nov 7, 2023 Bayesian Optimization Thompson Sampling
— Unverified 0Batch Bayesian Optimization for Replicable Experimental Design Nov 2, 2023 AutoML Bayesian Optimization
— Unverified 0Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning Oct 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Dual-Directed Algorithm Design for Efficient Pure Exploration Oct 30, 2023 Thompson Sampling
— Unverified 0Little Exploration is All You Need Oct 26, 2023 All Thompson Sampling
— Unverified 0Making RL with Preference-based Feedback Efficient via Randomization Oct 23, 2023 Active Learning Thompson Sampling
— Unverified 0Parallel Bayesian Optimization Using Satisficing Thompson Sampling for Time-Sensitive Black-Box Optimization Oct 19, 2023 Bayesian Optimization STS
— Unverified 0Using Adaptive Bandit Experiments to Increase and Investigate Engagement in Mental Health Oct 13, 2023 Thompson Sampling
Code Code Available 0Optimal Exploration is no harder than Thompson Sampling Oct 9, 2023 Thompson Sampling
— Unverified 0Module-wise Adaptive Distillation for Multimodality Foundation Models Oct 6, 2023 Image Captioning Thompson Sampling
— Unverified 0From Bandits Model to Deep Deterministic Policy Gradient, Reinforcement Learning with Contextual Information Oct 1, 2023 Decision Making reinforcement-learning
— Unverified 0Thompson Exploration with Best Challenger Rule in Best Arm Identification Oct 1, 2023 Thompson Sampling
— Unverified 0Monte-Carlo tree search with uncertainty propagation via optimal transport Sep 19, 2023 Thompson Sampling
— Unverified 0Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits Sep 18, 2023 Dialogue Act Classification Multi-Armed Bandits
— Unverified 0gym-saturation: Gymnasium environments for saturation provers (System description) Sep 16, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors Sep 12, 2023 Thompson Sampling
— Unverified 0Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages Aug 28, 2023 Decision Making Decision Making Under Uncertainty
Code Code Available 0Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit Aug 20, 2023 Thompson Sampling
— Unverified 0AdaptEx: A Self-Service Contextual Bandit Platform Aug 8, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0VITS : Variational Inference Thompson Sampling for contextual bandits Jul 19, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Approximate information for efficient exploration-exploitation strategies Jul 4, 2023 Decision Making Efficient Exploration
— Unverified 0Thompson Sampling under Bernoulli Rewards with Local Differential Privacy Jul 3, 2023 Thompson Sampling
— Unverified 0Thompson sampling for improved exploration in GFlowNets Jun 30, 2023 Active Learning Decision Making
— Unverified 0Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Jun 26, 2023 Decision Making Thompson Sampling
— Unverified 0Scalable Neural Contextual Bandit for Recommender Systems Jun 26, 2023 Recommendation Systems Thompson Sampling
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space Jun 5, 2023 Thompson Sampling
— Unverified 0Incentivizing Exploration with Linear Contexts and Combinatorial Actions Jun 3, 2023 Thompson Sampling
— Unverified 0ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages Jun 2, 2023 Bayesian Inference continuous-control
Code Code Available 0Combinatorial Neural Bandits May 31, 2023 Thompson Sampling
— Unverified 0Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation May 24, 2023 Thompson Sampling
— Unverified 0Discounted Thompson Sampling for Non-Stationary Bandit Problems May 18, 2023 Thompson Sampling
— Unverified 0