KLUCB Approach to Copeland Bandits Feb 7, 2019 Information Retrieval Reinforcement Learning
— Unverified 0Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits May 30, 2021 Edge-computing Portfolio Optimization
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Latent Bandits Revisited Jun 15, 2020 Recommendation Systems Thompson Sampling
— Unverified 0Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect Jun 18, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Sample Efficient Learning of Factored Embeddings of Tensor Fields Sep 1, 2022 Recommendation Systems Thompson Sampling
— Unverified 0Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration Feb 8, 2023 Bayesian Inference Thompson Sampling
— Unverified 0Learning to Optimize Via Posterior Sampling Jan 11, 2013 Thompson Sampling
— Unverified 0Learning to Price with Reference Effects Aug 29, 2017 Reinforcement Learning Thompson Sampling
— Unverified 0Learning to Rank in the Position Based Model with Bandit Feedback Apr 27, 2020 Learning-To-Rank Multi-Armed Bandits
— Unverified 0Learning Unknown Markov Decision Processes: A Thompson Sampling Approach Sep 14, 2017 Reinforcement Learning Thompson Sampling
— Unverified 0Lenient Regret for Multi-Armed Bandits Aug 10, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Leveraging Demonstrations to Improve Online Learning: Quality Matters Feb 7, 2023 Thompson Sampling
— Unverified 0Leveraging Offline Data from Similar Systems for Online Linear Quadratic Control May 14, 2025 Thompson Sampling
— Unverified 0Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits May 27, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Linear Bandit algorithms using the Bootstrap May 4, 2016 Thompson Sampling
— Unverified 0Linear Thompson Sampling Revisited Nov 20, 2016 Thompson Sampling
— Unverified 0Little Exploration is All You Need Oct 26, 2023 All Thompson Sampling
— Unverified 0Maillard Sampling: Boltzmann Exploration Done Optimally Nov 5, 2021 counterfactual Thompson Sampling
— Unverified 0Making RL with Preference-based Feedback Efficient via Randomization Oct 23, 2023 Active Learning Thompson Sampling
— Unverified 0Making Sense of Reinforcement Learning and Probabilistic Inference Jan 3, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow Jul 1, 2021 Decision Making Marketing
— Unverified 0Optimization-Driven Adaptive Experimentation Aug 8, 2024 GPU Thompson Sampling
— Unverified 0Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents Jun 18, 2024 continuous-control Continuous Control
— Unverified 0Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models Aug 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Meta Dynamic Pricing: Transfer Learning Across Experiments Feb 28, 2019 Thompson Sampling Transfer Learning
— Unverified 0Meta Learning in Bandits within Shared Affine Subspaces Mar 31, 2024 Meta-Learning Thompson Sampling
— Unverified 0Metalearning Linear Bandits by Prior Update Jul 12, 2021 Decision Making Sequential Decision Making
— Unverified 0Meta Learning of Interface Conditions for Multi-Domain Physics-Informed Neural Networks Oct 23, 2022 Meta-Learning Thompson Sampling
— Unverified 0Meta-Reinforcement Learning With Informed Policy Regularization Jan 1, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Meta-Thompson Sampling Feb 11, 2021 Efficient Exploration Meta-Learning
— Unverified 0Minimal Exploration in Structured Stochastic Bandits Nov 1, 2017 Thompson Sampling
— Unverified 0TS-RSR: A provably efficient approach for batch Bayesian Optimization Mar 7, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Mixed-Variable Bayesian Optimization Jul 2, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models Feb 16, 2021 Decision Making Meta Reinforcement Learning
— Unverified 0Model-Free Approximate Bayesian Learning for Large-Scale Conversion Funnel Optimization Jan 12, 2024 Decision Making Marketing
— Unverified 0Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis Sep 10, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 0Module-wise Adaptive Distillation for Multimodality Foundation Models Oct 6, 2023 Image Captioning Thompson Sampling
— Unverified 0Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning Nov 23, 2022 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Monte-Carlo tree search with uncertainty propagation via optimal transport Sep 19, 2023 Thompson Sampling
— Unverified 0MOTS: Minimax Optimal Thompson Sampling Mar 3, 2020 Thompson Sampling
— Unverified 0Multi-Agent Active Search using Detection and Location Uncertainty Mar 9, 2022 Decision Making Disaster Response
— Unverified 0Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian? Jun 5, 2021 Thompson Sampling
— Unverified 0Multi-Armed Bandit Strategies for Non-Stationary Reward Distributions and Delayed Feedback Processes Feb 22, 2019 Thompson Sampling
— Unverified 0Multi-dueling Bandits with Dependent Arms Apr 29, 2017 Thompson Sampling
— Unverified 0Multi-Task Combinatorial Bandits for Budget Allocation Aug 31, 2024 Gaussian Processes Marketing
— Unverified 0Near Optimal Adversarial Attacks on Stochastic Bandits and Defenses with Smoothed Responses Aug 21, 2020 Adversarial Attack Thompson Sampling
— Unverified 0Neural Contextual Bandits Under Delayed Feedback Constraints Apr 16, 2025 Multi-Armed Bandits Recommendation Systems
— Unverified 0Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Jul 24, 2024 Thompson Sampling
— Unverified 0Neural Model-based Optimization with Right-Censored Observations Sep 29, 2020 model regression
— Unverified 0