Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning Oct 2, 2021 Multi-Armed Bandits regression
— Unverified 0Feel-Good Thompson Sampling for Contextual Dueling Bandits Apr 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0First-Order Bayesian Regret Analysis of Thompson Sampling Feb 2, 2019 Combinatorial Optimization Thompson Sampling
— Unverified 0Fixed-Confidence Guarantees for Bayesian Best-Arm Identification Oct 24, 2019 Thompson Sampling
— Unverified 0Fourier Representations for Black-Box Optimization over Categorical Variables Feb 8, 2022 regression Thompson Sampling
— Unverified 0Freshness-Aware Thompson Sampling Sep 29, 2014 Recommendation Systems Thompson Sampling
— Unverified 0From Bandits Model to Deep Deterministic Policy Gradient, Reinforcement Learning with Contextual Information Oct 1, 2023 Decision Making reinforcement-learning
— Unverified 0Fully Distributed Bayesian Optimization with Stochastic Policies Feb 26, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Gaussian Process Thompson Sampling via Rootfinding Oct 10, 2024 Bayesian Optimization Decision Making
— Unverified 0Generalized Bayesian deep reinforcement learning Dec 16, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Generalized Probabilistic Bisection for Stochastic Root-Finding Nov 2, 2017 Thompson Sampling
— Unverified 0Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors Sep 12, 2023 Thompson Sampling
— Unverified 0Generalized Thompson Sampling for Contextual Bandits Oct 27, 2013 Multi-Armed Bandits Thompson Sampling
— Unverified 0Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions May 22, 2025 Large Language Model Thompson Sampling
— Unverified 0Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Jun 26, 2023 Decision Making Thompson Sampling
— Unverified 0Graph Neural Thompson Sampling Jun 15, 2024 Decision Making Graph Neural Network
— Unverified 0Feedback graph regret bounds for Thompson Sampling and UCB May 23, 2019 Thompson Sampling
— Unverified 0Greedy Bandits with Sampled Context Jul 27, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Greedy k-Center from Noisy Distance Samples Nov 3, 2020 Thompson Sampling
— Unverified 0GuideBoot: Guided Bootstrap for Deep Contextual Bandits Jul 18, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent Active Search Apr 4, 2023 All Disaster Response
— Unverified 0gym-saturation: Gymnasium environments for saturation provers (System description) Sep 16, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Hierarchical Bayesian Bandits Nov 12, 2021 Federated Learning Thompson Sampling
— Unverified 0High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling Apr 23, 2021 Bayesian Inference Drug Discovery
— Unverified 0Horde of Bandits using Gaussian Markov Random Fields Mar 7, 2017 Clustering Multi-Armed Bandits
— Unverified 0Human collective intelligence as distributed Bayesian inference Aug 5, 2016 Bayesian Inference Decision Making
— Unverified 0Hypermodels for Exploration Jun 12, 2020 Thompson Sampling
— Unverified 0IBAC: An Intelligent Dynamic Bandwidth Channel Access Avoiding Outside Warning Range Problem Jan 15, 2022 Thompson Sampling
— Unverified 0Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning Oct 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems Jul 1, 2018 Reinforcement Learning Thompson Sampling
— Unverified 0Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration Oct 23, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Jun 16, 2024 Multi-Armed Bandits Policy Gradient Methods
— Unverified 0Improving sample efficiency of high dimensional Bayesian optimization with MCMC Jan 5, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits Aug 28, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Incentivized Exploration for Multi-Armed Bandits under Reward Drift Nov 12, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Incentivizing Combinatorial Bandit Exploration Jun 1, 2022 Thompson Sampling
— Unverified 0Incentivizing Exploration with Linear Contexts and Combinatorial Actions Jun 3, 2023 Thompson Sampling
— Unverified 0Incorporating Behavioral Constraints in Online AI Systems Sep 15, 2018 Thompson Sampling
— Unverified 0Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits Aug 10, 2022 Management Multi-Armed Bandits
— Unverified 0Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits May 24, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0In-Domain African Languages Translation Using LLMs and Multi-armed Bandits May 21, 2025 Domain Adaptation Machine Translation
— Unverified 0Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems Jul 9, 2020 Thompson Sampling
— Unverified 0Influencing Bandits: Arm Selection for Preference Shaping Feb 29, 2024 Recommendation Systems Thompson Sampling
— Unverified 0Information Directed Sampling and Bandits with Heteroscedastic Noise Jan 29, 2018 Bayesian Optimization Thompson Sampling
— Unverified 0Information Directed Sampling for Stochastic Bandits with Graph Feedback Nov 8, 2017 Decision Making Thompson Sampling
— Unverified 0Information-Theoretic Confidence Bounds for Reinforcement Learning Nov 21, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0IntelligentPooling: Practical Thompson Sampling for mHealth Jul 31, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Joint User Association and Pairing in Multi-UAV-Assisted NOMA Networks: A Decaying-Epsilon Thompson Sampling Framework Jun 20, 2024 Thompson Sampling
— Unverified 0KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems Feb 11, 2025 Thompson Sampling
— Unverified 0