Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 0DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation Jun 10, 2024 Thompson Sampling
— Unverified 0Discounted Thompson Sampling for Non-Stationary Bandit Problems May 18, 2023 Thompson Sampling
— Unverified 0Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning Nov 29, 2020 Action Generation Decision Making
— Unverified 0Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adaptive Combinatorial Allocation Nov 4, 2020 Thompson Sampling
— Unverified 0Diversified Sampling for Batched Bayesian Optimization with Determinantal Point Processes Oct 22, 2021 Bayesian Optimization Diversity
— Unverified 0Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Double-Linear Thompson Sampling for Context-Attentive Bandits Oct 15, 2020 Medical Diagnosis Thompson Sampling
— Unverified 0AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning Apr 8, 2019 Bayesian Optimization Inductive Bias
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0Double Thompson Sampling in Finite stochastic Games Feb 21, 2022 Thompson Sampling
— Unverified 0Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 0Doubly robust Thompson sampling for linear payoffs Feb 1, 2021 Thompson Sampling
— Unverified 0Doubly Robust Thompson Sampling with Linear Payoffs Dec 1, 2021 Thompson Sampling
— Unverified 0DRL-based Joint Resource Scheduling of eMBB and URLLC in O-RAN Jul 16, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0Dual-Directed Algorithm Design for Efficient Pure Exploration Oct 30, 2023 Thompson Sampling
— Unverified 0Bandit Convex Optimization: sqrtT Regret in One Dimension Feb 23, 2015 Thompson Sampling
— Unverified 0Dynamic collaborative filtering Thompson Sampling for cross-domain advertisements recommendation Aug 25, 2022 Collaborative Filtering Recommendation Systems
— Unverified 0Dynamic Decision-Making under Model Misspecification May 20, 2025 Decision Making model
— Unverified 0Bayesian Quantile and Expectile Optimisation Jan 12, 2020 Bayesian Optimisation Gaussian Processes
— Unverified 0An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits Dec 3, 2024 Thompson Sampling
— Unverified 0Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Oct 7, 2020 Thompson Sampling
— Unverified 0Efficient and Adaptive Posterior Sampling Algorithms for Bandits May 2, 2024 Thompson Sampling
— Unverified 0Efficient Benchmarking of NLP APIs using Multi-armed Bandits Apr 1, 2017 Benchmarking Multi-Armed Bandits
— Unverified 0Efficient Exploration for LLMs Feb 1, 2024 Efficient Exploration Thompson Sampling
— Unverified 0Efficient exploration of zero-sum stochastic games Feb 24, 2020 Efficient Exploration Thompson Sampling
— Unverified 0Bandits Under The Influence (Extended Version) Sep 21, 2020 Recommendation Systems Thompson Sampling
— Unverified 0Efficient exploration with Double Uncertain Value Networks Nov 29, 2017 Efficient Exploration Reinforcement Learning
— Unverified 0Efficient Inference Without Trading-off Regret in Bandits: An Allocation Probability Test for Thompson Sampling Oct 30, 2021 Thompson Sampling
— Unverified 0Efficient kernelized bandit algorithms via exploration distributions Jun 11, 2025 Thompson Sampling
— Unverified 0Efficient Learning in Large-Scale Combinatorial Semi-Bandits Jun 28, 2014 Thompson Sampling
— Unverified 0Adaptively Optimize Content Recommendation Using Multi Armed Bandit Algorithms in E-commerce Jul 30, 2021 Thompson Sampling
— Unverified 0Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling Oct 7, 2024 continuous-control Continuous Control
— Unverified 0Efficient Multivariate Bandit Algorithm with Path Planning Sep 6, 2019 Heuristic Search Thompson Sampling
— Unverified 0Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling Aug 24, 2020 Deep Reinforcement Learning Thompson Sampling
— Unverified 0Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Efficient Thompson Sampling for Online Matrix-Factorization Recommendation Dec 1, 2015 Collaborative Filtering Recommendation Systems
— Unverified 0Efficient-UCBV: An Almost Optimal Algorithm using Variance Estimates Nov 9, 2017 Thompson Sampling
— Unverified 0Eluder Dimension and the Sample Complexity of Optimistic Exploration Dec 1, 2013 Thompson Sampling
— Unverified 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Ensemble Sampling May 20, 2017 Thompson Sampling
— Unverified 0Epinet for Content Cold Start Nov 20, 2024 Recommendation Systems Thompson Sampling
— Unverified 0Epsilon-Greedy Thompson Sampling to Bayesian Optimization Mar 1, 2024 Bayesian Optimization Cantilever Beam
— Unverified 0Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies Nov 16, 2017 Decision Making Thompson Sampling
— Unverified 0Estimating prediction error for complex samples Nov 13, 2017 Prediction Survey
— Unverified 0A Copula approach for hyperparameter transfer learning Sep 25, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0Etat de l'art sur l'application des bandits multi-bras Jan 4, 2021 Thompson Sampling
— Unverified 0EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning Jan 16, 2025 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation May 2, 2024 Bayesian Optimization Conversational Recommendation
— Unverified 0