From Predictions to Decisions: The Importance of Joint Predictive Distributions Jul 20, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems Apr 28, 2015 News Recommendation Thompson Sampling
— Unverified 00 Convergence Rates of Posterior Distributions in Markov Decision Process Jul 22, 2019 Thompson Sampling
— Unverified 00 Expected Improvement-based Contextual Bandits Sep 29, 2021 Bayesian Optimization Multi-Armed Bandits
— Unverified 00 A study of Thompson Sampling with Parameter h Oct 5, 2017 Thompson Sampling
— Unverified 00 A Formal Solution to the Grain of Truth Problem Sep 16, 2016 Thompson Sampling
— Unverified 00 AdaptEx: A Self-Service Contextual Bandit Platform Aug 8, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Contextual Thompson Sampling via Generation of Missing Data Feb 10, 2025 Decision Making Fairness
— Unverified 00 Contextual Multi-Armed Bandits for Causal Marketing Oct 2, 2018 Causal Inference counterfactual
— Unverified 00 A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model Jan 31, 2019 Recommendation Systems Thompson Sampling
— Unverified 00 Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications Aug 26, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A sequential Monte Carlo approach to Thompson sampling for Bayesian optimization Apr 1, 2016 Bayesian Optimization Thompson Sampling
— Unverified 00 A Federated Online Restless Bandit Framework for Cooperative Resource Allocation Jun 12, 2024 Federated Learning Multi-Armed Bandits
— Unverified 00 Contextual Bandits with Non-Stationary Correlated Rewards for User Association in MmWave Vehicular Networks Oct 8, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Contextual Bandits for Advertising Budget Allocation Aug 22, 2020 Marketing Multi-Armed Bandits
— Unverified 00 A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food Mar 15, 2024 Scheduling Thompson Sampling
— Unverified 00 Adaptive Portfolio by Solving Multi-armed Bandit via Thompson Sampling Nov 13, 2019 Decision Making Management
— Unverified 00 Context Attribution with Multi-Armed Bandit Optimization Jun 24, 2025 Thompson Sampling
— Unverified 00 A Reliability-aware Multi-armed Bandit Approach to Learn and Select Users in Demand Response Mar 20, 2020 Avg Thompson Sampling
— Unverified 00 Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization May 10, 2022 Bayesian Optimization Thompson Sampling
— Unverified 00 Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Context Attentive Bandits: Contextual Bandit with Restricted Context May 10, 2017 Recommendation Systems Thompson Sampling
— Unverified 00 A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems Aug 19, 2021 Thompson Sampling
— Unverified 00 Constrained Thompson Sampling for Wireless Link Optimization Feb 28, 2019 Thompson Sampling
— Unverified 00 A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Apr 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 00 Constrained Thompson Sampling for Real-Time Electricity Pricing with Grid Reliability Constraints Jun 17, 2020 Thompson Sampling
— Unverified 00 Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection Mar 9, 2021 Thompson Sampling
— Unverified 00 Efficiently Tackling Million-Dimensional Multiobjective Problems: A Direction Sampling and Fine-Tuning Approach Apr 8, 2023 Multiobjective Optimization Recommendation Systems
— Unverified 00 Connections Between Mirror Descent, Thompson Sampling and the Information Ratio May 28, 2019 Thompson Sampling
— Unverified 00 Connecting Thompson Sampling and UCB: Towards More Efficient Trade-offs Between Privacy and Regret May 5, 2025 Thompson Sampling
— Unverified 00 A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 00 A Distributed Neural Linear Thompson Sampling Framework to Achieve URLLC in Industrial IoT Nov 21, 2023 Scheduling Thompson Sampling
— Unverified 00 Active Reinforcement Learning with Monte-Carlo Tree Search Mar 13, 2018 reinforcement-learning Reinforcement Learning
— Unverified 00 Accelerating Grasp Exploration by Leveraging Learned Priors Nov 11, 2020 Object Thompson Sampling
— Unverified 00 Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs Jun 5, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Combining Bayesian Optimization and Lipschitz Optimization Oct 10, 2018 Bayesian Optimization global-optimization
— Unverified 00 A Practical Method for Solving Contextual Bandit Problems Using Decision Trees Jun 14, 2017 Thompson Sampling
— Unverified 00 Combinatorial Neural Bandits May 31, 2023 Thompson Sampling
— Unverified 00 Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms: A Case with Bounded Regret Jul 24, 2017 Movie Recommendation Thompson Sampling
— Unverified 00 Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation Feb 18, 2022 Thompson Sampling
— Unverified 00 Combinatorial Multi-armed Bandits: Arm Selection via Group Testing Oct 14, 2024 Multi-Armed Bandits parameter estimation
— Unverified 00 Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 00 Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(T) Regret May 29, 2024 Thompson Sampling
— Unverified 00 Code Repair with LLMs gives an Exploration-Exploitation Tradeoff May 26, 2024 Code Repair Language Modeling
— Unverified 00 Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level Synthesis Jul 3, 2022 Active Learning Descriptive
— Unverified 00 Approximate information for efficient exploration-exploitation strategies Jul 4, 2023 Decision Making Efficient Exploration
— Unverified 00 Fast Change Identification in Multi-Play Bandits and its Applications in Wireless Networks May 20, 2022 Change Detection Edge-computing
— Unverified 00 Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments Mar 22, 2021 Thompson Sampling
— Unverified 00 Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems Mar 5, 2024 Thompson Sampling
— Unverified 00