The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Asynchronous Multi Agent Active Search Jun 25, 2020 Bayesian Optimization Compressive Sensing
— Unverified 00 Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems Jan 25, 2022 parameter estimation Thompson Sampling
— Unverified 00 A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning Mar 10, 2023 Thompson Sampling
— Unverified 00 Automatic Ensemble Learning for Online Influence Maximization Nov 25, 2019 Ensemble Learning Multi-Armed Bandits
— Unverified 00 AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning Apr 8, 2019 Bayesian Optimization Inductive Bias
— Unverified 00 Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 00 BanditCAT and AutoIRT: Machine Learning Approaches to Computerized Adaptive Testing and Item Calibration Oct 28, 2024 AutoML Thompson Sampling
— Unverified 00 Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control Sep 24, 2020 Change Point Detection Computational Efficiency
— Unverified 00 Bandit Convex Optimization: sqrtT Regret in One Dimension Feb 23, 2015 Thompson Sampling
— Unverified 00 Bandit Learning for Diversified Interactive Recommendation Jul 1, 2019 Bayesian Inference Diversity
— Unverified 00 Bandit Models of Human Behavior: Reward Processing in Mental Disorders Jun 7, 2017 Decision Making Thompson Sampling
— Unverified 00 Bandit Policies for Reliable Cellular Network Handovers in Extreme Mobility Oct 28, 2020 Thompson Sampling
— Unverified 00 Bandits Under The Influence (Extended Version) Sep 21, 2020 Recommendation Systems Thompson Sampling
— Unverified 00 Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization Jun 5, 2022 BIG-bench Machine Learning Evolutionary Algorithms
— Unverified 00 Batch Bayesian Optimization for Replicable Experimental Design Nov 2, 2023 AutoML Bayesian Optimization
— Unverified 00 Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits Jun 20, 2024 Bayesian Inference Thompson Sampling
— Unverified 00 Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies Nov 16, 2017 Decision Making Thompson Sampling
— Unverified 00 Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health Program Oct 28, 2024 Matrix Completion Thompson Sampling
— Unverified 00 Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 00 Bayesian-Guided Generation of Synthetic Microbiomes with Minimized Pathogenicity Apr 29, 2024 Bayesian Optimization Thompson Sampling
— Unverified 00 Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space Jun 5, 2023 Thompson Sampling
— Unverified 00 Bayesian learning of the optimal action-value function in a Markov decision process May 3, 2025 Decision Making Sequential Decision Making
— Unverified 00 Bayesian Mixture Modelling and Inference based Thompson Sampling in Monte-Carlo Tree Search Dec 1, 2013 Thompson Sampling
— Unverified 00 Bayesian Optimization-Based Beam Alignment for MmWave MIMO Communication Systems Jul 28, 2022 Bayesian Optimization Thompson Sampling
— Unverified 00 Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient? Jun 13, 2025 Bayesian Optimization Thompson Sampling
— Unverified 00 Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation May 2, 2024 Bayesian Optimization Conversational Recommendation
— Unverified 00 Bayesian Quantile and Expectile Optimisation Jan 12, 2020 Bayesian Optimisation Gaussian Processes
— Unverified 00 BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Nov 15, 2017 Deep Reinforcement Learning Efficient Exploration
— Unverified 00 BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Aug 17, 2016 Deep Reinforcement Learning Efficient Exploration
— Unverified 00 Belief Flows of Robust Online Learning May 26, 2015 General Classification regression
— Unverified 00 Best Arm Identification in Batched Multi-armed Bandit Problems Dec 21, 2023 Marketing Thompson Sampling
— Unverified 00 Active RLHF via Best Policy Learning from Trajectory Preference Feedback Jan 31, 2025 Thompson Sampling
— Unverified 00 Better Optimism By Bayes: Adaptive Planning with Rich Models Feb 9, 2014 Model-based Reinforcement Learning Reinforcement Learning
— Unverified 00 Blind Exploration and Exploitation of Stochastic Experts Apr 2, 2021 Thompson Sampling
— Unverified 00 Bootstrapped Thompson Sampling and Deep Exploration Jul 1, 2015 reinforcement-learning Reinforcement Learning
— Unverified 00 BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings Nov 30, 2024 Bayesian Optimization Policy Gradient Methods
— Unverified 00 Calibrated Fairness in Bandits Jul 6, 2017 Decision Making Fairness
— Unverified 00 Causal Bandits without prior knowledge using separating sets Sep 16, 2020 Causal Discovery Decision Making
— Unverified 00 Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems Mar 5, 2024 Thompson Sampling
— Unverified 00 Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments Mar 22, 2021 Thompson Sampling
— Unverified 00 Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level Synthesis Jul 3, 2022 Active Learning Descriptive
— Unverified 00 Code Repair with LLMs gives an Exploration-Exploitation Tradeoff May 26, 2024 Code Repair Language Modeling
— Unverified 00 Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 00 Combinatorial Multi-armed Bandits: Arm Selection via Group Testing Oct 14, 2024 Multi-Armed Bandits parameter estimation
— Unverified 00 Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms: A Case with Bounded Regret Jul 24, 2017 Movie Recommendation Thompson Sampling
— Unverified 00