Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis Sep 10, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 00 Module-wise Adaptive Distillation for Multimodality Foundation Models Oct 6, 2023 Image Captioning Thompson Sampling
— Unverified 00 Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning Nov 23, 2022 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 00 Monte-Carlo tree search with uncertainty propagation via optimal transport Sep 19, 2023 Thompson Sampling
— Unverified 00 MOTS: Minimax Optimal Thompson Sampling Mar 3, 2020 Thompson Sampling
— Unverified 00 Multi-Agent Active Search using Detection and Location Uncertainty Mar 9, 2022 Decision Making Disaster Response
— Unverified 00 Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian? Jun 5, 2021 Thompson Sampling
— Unverified 00 Multi-Armed Bandit Strategies for Non-Stationary Reward Distributions and Delayed Feedback Processes Feb 22, 2019 Thompson Sampling
— Unverified 00 Multi-armed Bandits with Cost Subsidy Nov 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Multi-dueling Bandits with Dependent Arms Apr 29, 2017 Thompson Sampling
— Unverified 00 Multi-Task Combinatorial Bandits for Budget Allocation Aug 31, 2024 Gaussian Processes Marketing
— Unverified 00 Near Optimal Adversarial Attacks on Stochastic Bandits and Defenses with Smoothed Responses Aug 21, 2020 Adversarial Attack Thompson Sampling
— Unverified 00 Neural Contextual Bandits Under Delayed Feedback Constraints Apr 16, 2025 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Jul 24, 2024 Thompson Sampling
— Unverified 00 Neural Model-based Optimization with Right-Censored Observations Sep 29, 2020 model regression
— Unverified 00 New Insights into Bootstrapping for Bandits May 24, 2018 Thompson Sampling
— Unverified 00 No Algorithmic Collusion in Two-Player Blindfolded Game with Thompson Sampling May 23, 2024 Thompson Sampling
— Unverified 00 Nonparametric General Reinforcement Learning Nov 28, 2016 General Reinforcement Learning reinforcement-learning
— Unverified 00 Non-Stationary Bandit Learning via Predictive Sampling May 4, 2022 Attribute Thompson Sampling
— Unverified 00 Non-Stationary Dynamic Pricing Via Actor-Critic Information-Directed Pricing Aug 19, 2022 Thompson Sampling
— Unverified 00 Non-Stationary Latent Bandits Dec 1, 2020 Recommendation Systems Thompson Sampling
— Unverified 00 No Regrets for Learning the Prior in Bandits Jul 13, 2021 Thompson Sampling
— Unverified 00 Observation-Free Attacks on Stochastic Bandits Dec 1, 2021 Thompson Sampling
— Unverified 00 On Adaptive Estimation for Dynamic Bernoulli Bandits Dec 8, 2017 Thompson Sampling
— Unverified 00 On Batch Bayesian Optimization Nov 4, 2019 Bayesian Optimization Thompson Sampling
— Unverified 00 On Dynamic Pricing with Covariates Dec 25, 2021 Thompson Sampling
— Unverified 00 On Efficiency in Hierarchical Reinforcement Learning Dec 1, 2020 Computational Efficiency Decision Making
— Unverified 00 On Improved Regret Bounds In Bayesian Optimization with Gaussian Noise Dec 25, 2024 Bayesian Optimization Thompson Sampling
— Unverified 00 On Kernelized Multi-Armed Bandits with Constraints Mar 29, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 On learning Whittle index policy for restless bandits with scalable regret Feb 7, 2022 Scheduling Thompson Sampling
— Unverified 00 Online Algorithms For Parameter Mean And Variance Estimation In Dynamic Regression Models May 18, 2016 parameter estimation regression
— Unverified 00 Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Feb 18, 2023 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Online Causal Inference for Advertising in Real-Time Bidding Auctions Aug 22, 2019 Causal Inference Experimental Design
— Unverified 00 Online Learning and Distributed Control for Residential Demand Response Oct 11, 2020 Stochastic Optimization Thompson Sampling
— Unverified 00 Online Learning-based Waveform Selection for Improved Vehicle Recognition in Automotive Radar Dec 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 00 Online Learning of Energy Consumption for Navigation of Electric Vehicles Nov 3, 2021 Navigate Thompson Sampling
— Unverified 00 Online Learning of Network Bottlenecks via Minimax Paths Sep 17, 2021 Thompson Sampling
— Unverified 00 Online Residential Demand Response via Contextual Multi-Armed Bandits Mar 7, 2020 Decision Making Multi-Armed Bandits
— Unverified 00 Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling Mar 16, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 On Multi-Armed Bandit Designs for Dose-Finding Clinical Trials Mar 17, 2019 Thompson Sampling
— Unverified 00 On Online Learning in Kernelized Markov Decision Processes Nov 4, 2019 Thompson Sampling
— Unverified 00 On The Differential Privacy of Thompson Sampling With Gaussian Prior Jun 24, 2018 Thompson Sampling
— Unverified 00 On the Importance of Uncertainty in Decision-Making with Large Language Models Apr 3, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 On the Performance of Thompson Sampling on Logistic Bandits May 12, 2019 Thompson Sampling
— Unverified 00 On the Prior Sensitivity of Thompson Sampling Jun 10, 2015 Sensitivity Thompson Sampling
— Unverified 00 On Thompson Sampling for Smoother-than-Lipschitz Bandits Jan 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 00 On Thompson Sampling with Langevin Algorithms Feb 23, 2020 Thompson Sampling
— Unverified 00 On Frequentist Regret of Linear Thompson Sampling Jun 11, 2020 Thompson Sampling
— Unverified 00 Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment Feb 16, 2021 Thompson Sampling
— Unverified 00 Optimal Exploration is no harder than Thompson Sampling Oct 9, 2023 Thompson Sampling
— Unverified 00