An Adversarial Analysis of Thompson Sampling for Full-information Online Learning: from Finite to Infinite Action Spaces Feb 20, 2025 Bayesian Optimization Thompson Sampling
— Unverified 0Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring Jun 17, 2020 Decision Making Thompson Sampling
— Unverified 0Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms Sep 7, 2018 Thompson Sampling
— Unverified 0Adaptive Rate of Convergence of Thompson Sampling for Gaussian Process Optimization May 18, 2017 global-optimization Thompson Sampling
— Unverified 0Analysis of Thompson Sampling for Graphical Bandits Without the Graphs May 23, 2018 Thompson Sampling
— Unverified 0Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits May 15, 2024 Autonomous Vehicles Thompson Sampling
— Unverified 0An Analysis of Ensemble Sampling Mar 2, 2022 Thompson Sampling
— Unverified 0An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits Sep 5, 2019 Decision Making Recommendation Systems
— Unverified 0An Efficient Algorithm For Generalized Linear Bandit: Online Stochastic Gradient Descent and Thompson Sampling Jun 7, 2020 Thompson Sampling
— Unverified 0A Formal Solution to the Grain of Truth Problem Sep 16, 2016 Thompson Sampling
— Unverified 0An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 0AdaptEx: A Self-Service Contextual Bandit Platform Aug 8, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0BanditCAT and AutoIRT: Machine Learning Approaches to Computerized Adaptive Testing and Item Calibration Oct 28, 2024 AutoML Thompson Sampling
— Unverified 0A Federated Online Restless Bandit Framework for Cooperative Resource Allocation Jun 12, 2024 Federated Learning Multi-Armed Bandits
— Unverified 0Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization May 10, 2022 Bayesian Optimization Thompson Sampling
— Unverified 0Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Distributed Neural Linear Thompson Sampling Framework to Achieve URLLC in Industrial IoT Nov 21, 2023 Scheduling Thompson Sampling
— Unverified 0Active Reinforcement Learning with Monte-Carlo Tree Search Mar 13, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0A Bandit Approach to Online Pricing for Heterogeneous Edge Resource Allocation Feb 14, 2023 Edge-computing Thompson Sampling
— Unverified 0AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning Apr 8, 2019 Bayesian Optimization Inductive Bias
— Unverified 0Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control Sep 24, 2020 Change Point Detection Computational Efficiency
— Unverified 0Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation Feb 18, 2022 Thompson Sampling
— Unverified 0Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(T) Regret May 29, 2024 Thompson Sampling
— Unverified 0Approximate information for efficient exploration-exploitation strategies Jul 4, 2023 Decision Making Efficient Exploration
— Unverified 0Fast Change Identification in Multi-Play Bandits and its Applications in Wireless Networks May 20, 2022 Change Detection Edge-computing
— Unverified 0A Bayesian Choice Model for Eliminating Feedback Loops Aug 15, 2019 Recommendation Systems Thompson Sampling
— Unverified 0A Practical Method for Solving Contextual Bandit Problems Using Decision Trees Jun 14, 2017 Thompson Sampling
— Unverified 0A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Efficiently Tackling Million-Dimensional Multiobjective Problems: A Direction Sampling and Fine-Tuning Approach Apr 8, 2023 Multiobjective Optimization Recommendation Systems
— Unverified 0A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Apr 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems Aug 19, 2021 Thompson Sampling
— Unverified 0A Reliability-aware Multi-armed Bandit Approach to Learn and Select Users in Demand Response Mar 20, 2020 Avg Thompson Sampling
— Unverified 0A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food Mar 15, 2024 Scheduling Thompson Sampling
— Unverified 0A sequential Monte Carlo approach to Thompson sampling for Bayesian optimization Apr 1, 2016 Bayesian Optimization Thompson Sampling
— Unverified 0A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0A study of Thompson Sampling with Parameter h Oct 5, 2017 Thompson Sampling
— Unverified 0Asymptotically Optimal Algorithms for Budgeted Multiple Play Bandits Jun 30, 2016 Thompson Sampling
— Unverified 0Asymptotically Optimal Bandits under Weighted Information May 28, 2021 Thompson Sampling
— Unverified 0Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget Jun 3, 2025 Thompson Sampling
— Unverified 0The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Aging Bandits: Regret Analysis and Order-Optimal Learning Algorithm for Wireless Networks with Stochastic Arrivals Dec 16, 2020 Thompson Sampling
— Unverified 0Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification Sep 29, 2021 Binary Classification Thompson Sampling
— Unverified 0Asynchronous Multi Agent Active Search Jun 25, 2020 Bayesian Optimization Compressive Sensing
— Unverified 0Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization Dec 15, 2021 Thompson Sampling
— Unverified 0An Unbiased Data Collection and Content Exploitation/Exploration Strategy for Personalization Apr 12, 2016 Recommendation Systems Thompson Sampling
— Unverified 0Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems Jan 25, 2022 parameter estimation Thompson Sampling
— Unverified 0Adaptive Sensor Placement for Continuous Spaces May 16, 2019 Thompson Sampling
— Unverified 0