An Analysis of Ensemble Sampling Mar 2, 2022 Thompson Sampling
— Unverified 00 Batch Bayesian Optimization for Replicable Experimental Design Nov 2, 2023 AutoML Bayesian Optimization
— Unverified 00 Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits May 15, 2024 Autonomous Vehicles Thompson Sampling
— Unverified 00 Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization Jun 5, 2022 BIG-bench Machine Learning Evolutionary Algorithms
— Unverified 00 Bandits Under The Influence (Extended Version) Sep 21, 2020 Recommendation Systems Thompson Sampling
— Unverified 00 Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 00 Bandit Policies for Reliable Cellular Network Handovers in Extreme Mobility Oct 28, 2020 Thompson Sampling
— Unverified 00 Bandit Models of Human Behavior: Reward Processing in Mental Disorders Jun 7, 2017 Decision Making Thompson Sampling
— Unverified 00 Analysis of Thompson Sampling for Graphical Bandits Without the Graphs May 23, 2018 Thompson Sampling
— Unverified 00 Adaptive Exploration-Exploitation Tradeoff for Opportunistic Bandits Sep 12, 2017 Thompson Sampling
— Unverified 00 A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms Jun 3, 2021 Thompson Sampling
— Unverified 00 Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 00 Bandit Learning for Diversified Interactive Recommendation Jul 1, 2019 Bayesian Inference Diversity
— Unverified 00 Adaptive Rate of Convergence of Thompson Sampling for Gaussian Process Optimization May 18, 2017 global-optimization Thompson Sampling
— Unverified 00 Bandit Convex Optimization: sqrtT Regret in One Dimension Feb 23, 2015 Thompson Sampling
— Unverified 00 Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control Sep 24, 2020 Change Point Detection Computational Efficiency
— Unverified 00 Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms Sep 7, 2018 Thompson Sampling
— Unverified 00 Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches Mar 21, 2023 Benchmarking Thompson Sampling
— Unverified 00 BanditCAT and AutoIRT: Machine Learning Approaches to Computerized Adaptive Testing and Item Calibration Oct 28, 2024 AutoML Thompson Sampling
— Unverified 00 Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 00 Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring Jun 17, 2020 Decision Making Thompson Sampling
— Unverified 00 AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning Apr 8, 2019 Bayesian Optimization Inductive Bias
— Unverified 00 Automatic Ensemble Learning for Online Influence Maximization Nov 25, 2019 Ensemble Learning Multi-Armed Bandits
— Unverified 00 An Adversarial Analysis of Thompson Sampling for Full-information Online Learning: from Finite to Infinite Action Spaces Feb 20, 2025 Bayesian Optimization Thompson Sampling
— Unverified 00 Adaptive Data Augmentation for Thompson Sampling Jun 17, 2025 Data Augmentation Multi-Armed Bandits
— Unverified 00 Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling Feb 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A Multi-Armed Bandit to Smartly Select a Training Set from Big Medical Data May 23, 2017 Thompson Sampling
— Unverified 00 A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning Mar 10, 2023 Thompson Sampling
— Unverified 00 Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems Jan 25, 2022 parameter estimation Thompson Sampling
— Unverified 00 Aligning AI Agents via Information-Directed Sampling Oct 18, 2024 Thompson Sampling
— Unverified 00 Differentially Private Federated Bayesian Optimization with Distributed Exploration Oct 27, 2021 Bayesian Optimization Federated Learning
— Unverified 00 Delay-Adaptive Learning in Generalized Linear Contextual Bandits Mar 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Deep Hierarchy in Bandits Feb 3, 2022 Thompson Sampling
— Unverified 00 Deep Contextual Multi-armed Bandits Jul 25, 2018 Marketing Multi-Armed Bandits
— Unverified 00 Asynchronous Multi Agent Active Search Jun 25, 2020 Bayesian Optimization Compressive Sensing
— Unverified 00 Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization Dec 15, 2021 Thompson Sampling
— Unverified 00 Adaptive Combinatorial Allocation Nov 4, 2020 Thompson Sampling
— Unverified 00 A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits Sep 6, 2020 Change Detection Thompson Sampling
— Unverified 00 A Batched Multi-Armed Bandit Approach to News Headline Testing Aug 17, 2019 Articles Thompson Sampling
— Unverified 00 Deep Active Ensemble Sampling For Image Classification Oct 11, 2022 Active Learning Classification
— Unverified 00 Deconfounded Warm-Start Thompson Sampling with Applications to Precision Medicine May 22, 2025 Thompson Sampling
— Unverified 00 Deciding What to Learn: A Rate-Distortion Approach Jan 15, 2021 Decision Making Sequential Decision Making
— Unverified 00 Deep Exploration for Recommendation Systems Sep 26, 2021 Recommendation Systems Thompson Sampling
— Unverified 00 Decentralized Multi-Agent Active Search and Tracking when Targets Outnumber Agents Jan 6, 2024 Decision Making Thompson Sampling
— Unverified 00 Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Aging Bandits: Regret Analysis and Order-Optimal Learning Algorithm for Wireless Networks with Stochastic Arrivals Dec 16, 2020 Thompson Sampling
— Unverified 00 Debiasing Samples from Online Learning Using Bootstrap Jul 31, 2021 Off-policy evaluation Thompson Sampling
— Unverified 00 Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models Nov 22, 2017 Multi-Armed Bandits Response Generation
— Unverified 00