From Predictions to Decisions: The Importance of Joint Predictive Distributions Jul 20, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems Apr 28, 2015 News Recommendation Thompson Sampling
— Unverified 0Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space Jun 5, 2023 Thompson Sampling
— Unverified 0Expected Improvement-based Contextual Bandits Sep 29, 2021 Bayesian Optimization Multi-Armed Bandits
— Unverified 0Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation May 2, 2024 Bayesian Optimization Conversational Recommendation
— Unverified 0An Information-Theoretic Analysis of Thompson Sampling Mar 21, 2014 Thompson Sampling
— Unverified 0Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits Aug 28, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning Oct 2, 2021 Multi-Armed Bandits regression
— Unverified 0An Information-Theoretic Analysis for Thompson Sampling with Many Actions May 30, 2018 Thompson Sampling
— Unverified 0Adaptively Learning to Select-Rank in Online Platforms Jun 7, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation May 24, 2023 Thompson Sampling
— Unverified 0Incentivized Exploration for Multi-Armed Bandits under Reward Drift Nov 12, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Online Learning with Cumulative Oversampling: Application to Budgeted Influence Maximization Apr 24, 2020 Thompson Sampling
— Unverified 0Bayesian Optimization-Based Beam Alignment for MmWave MIMO Communication Systems Jul 28, 2022 Bayesian Optimization Thompson Sampling
— Unverified 0A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification Jun 16, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Feel-Good Thompson Sampling for Contextual Dueling Bandits Apr 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient? Jun 13, 2025 Bayesian Optimization Thompson Sampling
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0First-Order Bayesian Regret Analysis of Thompson Sampling Feb 2, 2019 Combinatorial Optimization Thompson Sampling
— Unverified 0Fixed-Confidence Guarantees for Bayesian Best-Arm Identification Oct 24, 2019 Thompson Sampling
— Unverified 0Fourier Representations for Black-Box Optimization over Categorical Variables Feb 8, 2022 regression Thompson Sampling
— Unverified 0Freshness-Aware Thompson Sampling Sep 29, 2014 Recommendation Systems Thompson Sampling
— Unverified 0From Bandits Model to Deep Deterministic Policy Gradient, Reinforcement Learning with Contextual Information Oct 1, 2023 Decision Making reinforcement-learning
— Unverified 0Fully Distributed Bayesian Optimization with Stochastic Policies Feb 26, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Gaussian Process Thompson Sampling via Rootfinding Oct 10, 2024 Bayesian Optimization Decision Making
— Unverified 0Generalized Bayesian deep reinforcement learning Dec 16, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Generalized Probabilistic Bisection for Stochastic Root-Finding Nov 2, 2017 Thompson Sampling
— Unverified 0Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors Sep 12, 2023 Thompson Sampling
— Unverified 0Generalized Thompson Sampling for Contextual Bandits Oct 27, 2013 Multi-Armed Bandits Thompson Sampling
— Unverified 0Best Arm Identification in Batched Multi-armed Bandit Problems Dec 21, 2023 Marketing Thompson Sampling
— Unverified 0Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions May 22, 2025 Large Language Model Thompson Sampling
— Unverified 0Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Jun 26, 2023 Decision Making Thompson Sampling
— Unverified 0Graph Neural Thompson Sampling Jun 15, 2024 Decision Making Graph Neural Network
— Unverified 0Feedback graph regret bounds for Thompson Sampling and UCB May 23, 2019 Thompson Sampling
— Unverified 0Greedy Bandits with Sampled Context Jul 27, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Greedy k-Center from Noisy Distance Samples Nov 3, 2020 Thompson Sampling
— Unverified 0GuideBoot: Guided Bootstrap for Deep Contextual Bandits Jul 18, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent Active Search Apr 4, 2023 All Disaster Response
— Unverified 0gym-saturation: Gymnasium environments for saturation provers (System description) Sep 16, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Hierarchical Bayesian Bandits Nov 12, 2021 Federated Learning Thompson Sampling
— Unverified 0High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling Apr 23, 2021 Bayesian Inference Drug Discovery
— Unverified 0Horde of Bandits using Gaussian Markov Random Fields Mar 7, 2017 Clustering Multi-Armed Bandits
— Unverified 0Human collective intelligence as distributed Bayesian inference Aug 5, 2016 Bayesian Inference Decision Making
— Unverified 0Hypermodels for Exploration Jun 12, 2020 Thompson Sampling
— Unverified 0IBAC: An Intelligent Dynamic Bandwidth Channel Access Avoiding Outside Warning Range Problem Jan 15, 2022 Thompson Sampling
— Unverified 0Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning Oct 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems Jul 1, 2018 Reinforcement Learning Thompson Sampling
— Unverified 0Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration Oct 23, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems Mar 5, 2024 Thompson Sampling
— Unverified 0Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network Feb 17, 2022 Bayesian Inference Generative Adversarial Network
— Unverified 0