Little Exploration is All You Need Oct 26, 2023 All Thompson Sampling
— Unverified 0qPOTS: Efficient batch multiobjective Bayesian optimization via Pareto optimal Thompson sampling Oct 24, 2023 Bayesian Optimization Computational Efficiency
Code Code Available 1Making RL with Preference-based Feedback Efficient via Randomization Oct 23, 2023 Active Learning Thompson Sampling
— Unverified 0Parallel Bayesian Optimization Using Satisficing Thompson Sampling for Time-Sensitive Black-Box Optimization Oct 19, 2023 Bayesian Optimization STS
— Unverified 0Using Adaptive Bandit Experiments to Increase and Investigate Engagement in Mental Health Oct 13, 2023 Thompson Sampling
Code Code Available 0Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining Oct 12, 2023 In-Context Reinforcement Learning reinforcement-learning
Code Code Available 1Optimal Exploration is no harder than Thompson Sampling Oct 9, 2023 Thompson Sampling
— Unverified 0Module-wise Adaptive Distillation for Multimodality Foundation Models Oct 6, 2023 Image Captioning Thompson Sampling
— Unverified 0Thompson Exploration with Best Challenger Rule in Best Arm Identification Oct 1, 2023 Thompson Sampling
— Unverified 0From Bandits Model to Deep Deterministic Policy Gradient, Reinforcement Learning with Contextual Information Oct 1, 2023 Decision Making reinforcement-learning
— Unverified 0Monte-Carlo tree search with uncertainty propagation via optimal transport Sep 19, 2023 Thompson Sampling
— Unverified 0Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits Sep 18, 2023 Dialogue Act Classification Multi-Armed Bandits
— Unverified 0gym-saturation: Gymnasium environments for saturation provers (System description) Sep 16, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors Sep 12, 2023 Thompson Sampling
— Unverified 0Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages Aug 28, 2023 Decision Making Decision Making Under Uncertainty
Code Code Available 0Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit Aug 20, 2023 Thompson Sampling
— Unverified 0AdaptEx: A Self-Service Contextual Bandit Platform Aug 8, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0VITS : Variational Inference Thompson Sampling for contextual bandits Jul 19, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Approximate information for efficient exploration-exploitation strategies Jul 4, 2023 Decision Making Efficient Exploration
— Unverified 0Thompson Sampling under Bernoulli Rewards with Local Differential Privacy Jul 3, 2023 Thompson Sampling
— Unverified 0Thompson sampling for improved exploration in GFlowNets Jun 30, 2023 Active Learning Decision Making
— Unverified 0Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Jun 26, 2023 Decision Making Thompson Sampling
— Unverified 0Scalable Neural Contextual Bandit for Recommender Systems Jun 26, 2023 Recommendation Systems Thompson Sampling
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space Jun 5, 2023 Thompson Sampling
— Unverified 0Incentivizing Exploration with Linear Contexts and Combinatorial Actions Jun 3, 2023 Thompson Sampling
— Unverified 0ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages Jun 2, 2023 Bayesian Inference continuous-control
Code Code Available 0Combinatorial Neural Bandits May 31, 2023 Thompson Sampling
— Unverified 0Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo May 29, 2023 Efficient Exploration reinforcement-learning
Code Code Available 1Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation May 24, 2023 Thompson Sampling
— Unverified 0Discounted Thompson Sampling for Non-Stationary Bandit Problems May 18, 2023 Thompson Sampling
— Unverified 0Sequential Best-Arm Identification with Application to Brain-Computer Interface May 17, 2023 Brain Computer Interface EEG
— Unverified 0Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions May 13, 2023 Bayesian Inference Thompson Sampling
— Unverified 0An improved regret analysis for UCB-N and TS-N May 6, 2023 LEMMA Thompson Sampling
— Unverified 0Trajectory-oriented optimization of stochastic epidemiological models May 6, 2023 Thompson Sampling
Code Code Available 0Neural Exploitation and Exploration of Contextual Bandits May 5, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 1Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards Apr 28, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards Apr 26, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Efficiently Tackling Million-Dimensional Multiobjective Problems: A Direction Sampling and Fine-Tuning Approach Apr 8, 2023 Multiobjective Optimization Recommendation Systems
— Unverified 0Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms Apr 6, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent Active Search Apr 4, 2023 All Disaster Response
— Unverified 0Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches Mar 21, 2023 Benchmarking Thompson Sampling
— Unverified 0Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling Mar 16, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning Mar 10, 2023 Thompson Sampling
— Unverified 0A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms Mar 10, 2023 Thompson Sampling
— Unverified 0Thompson Sampling for Linear Bandit Problems with Normal-Gamma Priors Mar 6, 2023 Thompson Sampling
— Unverified 0The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0When Combinatorial Thompson Sampling meets Approximation Regret Feb 22, 2023 Thompson Sampling
— Unverified 0