Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits Feb 3, 2023 Thompson Sampling
— Unverified 00 Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks Nov 27, 2018 Thompson Sampling
— Unverified 00 Optimal No-regret Learning in Repeated First-price Auctions Mar 22, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs Mar 30, 2016 Recommendation Systems Reinforcement Learning
— Unverified 00 Optimistic posterior sampling for reinforcement learning: worst-case regret bounds Dec 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 00 Optimistic Thompson Sampling for No-Regret Learning in Unknown Games Feb 7, 2024 Decision Making Thompson Sampling
— Unverified 00 Optimization of a SSP's Header Bidding Strategy using Thompson Sampling Jul 9, 2018 Thompson Sampling
— Unverified 00 Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification Feb 16, 2024 Thompson Sampling
— Unverified 00 Ordinal Bayesian Optimisation Dec 5, 2019 Bayesian Optimisation Thompson Sampling
— Unverified 00 Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space Jun 6, 2017 Bayesian Optimization Thompson Sampling
— Unverified 00 Parallel Bayesian Optimization Using Satisficing Thompson Sampling for Time-Sensitive Black-Box Optimization Oct 19, 2023 Bayesian Optimization STS
— Unverified 00 Parallel Contextual Bandits in Wireless Handover Optimization Jan 21, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Parallelizing Thompson Sampling Jun 2, 2021 Decision Making Thompson Sampling
— Unverified 00 Partial Likelihood Thompson Sampling Mar 2, 2022 Thompson Sampling
— Unverified 00 Partially Observable Contextual Bandits with Linear Payoffs Sep 17, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Partially Observable Online Change Detection via Smooth-Sparse Decomposition Sep 22, 2020 Bayesian Inference Change Detection
— Unverified 00 PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits May 18, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem Oct 30, 2024 Scheduling Thompson Sampling
— Unverified 00 Policy Gradient Optimization of Thompson Sampling Policies Jun 30, 2020 Policy Gradient Methods Thompson Sampling
— Unverified 00 Position-Based Multiple-Play Bandits with Thompson Sampling Sep 28, 2020 Position Recommendation Systems
— Unverified 00 Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds Nov 7, 2023 Bayesian Optimization Thompson Sampling
— Unverified 00 Posterior sampling for reinforcement learning: worst-case regret bounds May 19, 2017 reinforcement-learning Reinforcement Learning
— Unverified 00 Posterior Sampling via Autoregressive Generation May 29, 2024 Articles Decision Making
— Unverified 00 Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection May 28, 2025 Thompson Sampling
— Unverified 00 Preferential Multi-Objective Bayesian Optimization Jun 20, 2024 Autonomous Driving Bayesian Optimization
— Unverified 00 Prior-free and prior-dependent regret bounds for Thompson Sampling Apr 21, 2013 Thompson Sampling
— Unverified 00 Probabilistic Inference in Reinforcement Learning Done Right Nov 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 00 Profitable Bandits May 8, 2018 Management Thompson Sampling
— Unverified 00 QoS-Aware Multi-Armed Bandits Feb 28, 2017 Decision Making Multi-Armed Bandits
— Unverified 00 Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors Aug 16, 2017 Thompson Sampling
— Unverified 00 Random Effect Bandits Jun 23, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization Jun 8, 2020 Bayesian Optimization Thompson Sampling
— Unverified 00 Randomised Bayesian Least-Squares Policy Iteration Apr 6, 2019 Thompson Sampling
— Unverified 00 Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Apr 16, 2024 Federated Learning Multi-agent Reinforcement Learning
— Unverified 00 Regenerative Particle Thompson Sampling Mar 15, 2022 Thompson Sampling
— Unverified 00 Regret Analysis of Bandit Problems with Causal Background Knowledge Oct 11, 2019 Thompson Sampling
— Unverified 00 Regret Analysis of the Finite-Horizon Gittins Index Strategy for Multi-Armed Bandits Nov 18, 2015 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Regret Bounds for Information-Directed Reinforcement Learning Jun 9, 2022 reinforcement-learning Reinforcement Learning
— Unverified 00 Regularized-OFU: an efficient algorithm for general contextual bandit with optimization oracles Sep 29, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Reinforcement Learning for Efficient and Tuning-Free Link Adaptation Oct 16, 2020 reinforcement-learning Reinforcement Learning
— Unverified 00 Reinforcement learning techniques for Outer Loop Link Adaptation in 4G/5G systems Aug 3, 2017 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Reinforcement Learning with Subspaces using Free Energy Paradigm Dec 13, 2020 reinforcement-learning Reinforcement Learning
— Unverified 00 Reinforcement Learning with Trajectory Feedback Aug 13, 2020 reinforcement-learning Reinforcement Learning
— Unverified 00 Remote Contextual Bandits Feb 10, 2022 Marketing Multi-Armed Bandits
— Unverified 00 Residual Bootstrap Exploration for Bandit Algorithms Feb 19, 2020 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Revised Progressive-Hedging-Algorithm Based Two-layer Solution Scheme for Bayesian Reinforcement Learning Jun 21, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 00 Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Nov 16, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Risk and optimal policies in bandit experiments Dec 13, 2021 Dimensionality Reduction Thompson Sampling
— Unverified 00 Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs Jun 24, 2022 Thompson Sampling
— Unverified 00 Risk-Constrained Thompson Sampling for CVaR Bandits Nov 16, 2020 Decision Making Thompson Sampling
— Unverified 00