A Multi-Armed Bandit to Smartly Select a Training Set from Big Medical Data May 23, 2017 Thompson Sampling
— Unverified 0Adaptive Combinatorial Allocation Nov 4, 2020 Thompson Sampling
— Unverified 0Automatic Ensemble Learning for Online Influence Maximization Nov 25, 2019 Ensemble Learning Multi-Armed Bandits
— Unverified 0AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning Apr 8, 2019 Bayesian Optimization Inductive Bias
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0BanditCAT and AutoIRT: Machine Learning Approaches to Computerized Adaptive Testing and Item Calibration Oct 28, 2024 AutoML Thompson Sampling
— Unverified 0Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control Sep 24, 2020 Change Point Detection Computational Efficiency
— Unverified 0Bandit Convex Optimization: sqrtT Regret in One Dimension Feb 23, 2015 Thompson Sampling
— Unverified 0Bandit Learning for Diversified Interactive Recommendation Jul 1, 2019 Bayesian Inference Diversity
— Unverified 0Adaptive Rate of Convergence of Thompson Sampling for Gaussian Process Optimization May 18, 2017 global-optimization Thompson Sampling
— Unverified 0Bandit Models of Human Behavior: Reward Processing in Mental Disorders Jun 7, 2017 Decision Making Thompson Sampling
— Unverified 0Bandit Policies for Reliable Cellular Network Handovers in Extreme Mobility Oct 28, 2020 Thompson Sampling
— Unverified 0Bandits Under The Influence (Extended Version) Sep 21, 2020 Recommendation Systems Thompson Sampling
— Unverified 0Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization Jun 5, 2022 BIG-bench Machine Learning Evolutionary Algorithms
— Unverified 0Batch Bayesian Optimization for Replicable Experimental Design Nov 2, 2023 AutoML Bayesian Optimization
— Unverified 0Adaptive Sensor Placement for Continuous Spaces May 16, 2019 Thompson Sampling
— Unverified 0Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits Sep 5, 2019 Decision Making Recommendation Systems
— Unverified 0Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits Jun 20, 2024 Bayesian Inference Thompson Sampling
— Unverified 0An Efficient Algorithm For Generalized Linear Bandit: Online Stochastic Gradient Descent and Thompson Sampling Jun 7, 2020 Thompson Sampling
— Unverified 0Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies Nov 16, 2017 Decision Making Thompson Sampling
— Unverified 0Code Repair with LLMs gives an Exploration-Exploitation Tradeoff May 26, 2024 Code Repair Language Modeling
— Unverified 0Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 0Bayesian-Guided Generation of Synthetic Microbiomes with Minimized Pathogenicity Apr 29, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space Jun 5, 2023 Thompson Sampling
— Unverified 0Adaptive Operator Selection Based on Dynamic Thompson Sampling for MOEA/D Apr 22, 2020 Thompson Sampling
— Unverified 0Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits Jul 19, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Quantile-based Approach for Hyperparameter Transfer Learning Sep 30, 2019 Bayesian Optimization Hyperparameter Optimization
— Unverified 0Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 0Combinatorial Multi-armed Bandits: Arm Selection via Group Testing Oct 14, 2024 Multi-Armed Bandits parameter estimation
— Unverified 0A Nonparametric Contextual Bandit with Arm-level Eligibility Control for Customer Service Routing Sep 8, 2022 Thompson Sampling
— Unverified 0An Online Learning Framework for Energy-Efficient Navigation of Electric Vehicles Mar 3, 2020 Navigate Thompson Sampling
— Unverified 0Adaptive Model Selection Framework: An Application to Airline Pricing May 21, 2019 Model Selection Thompson Sampling
— Unverified 0Belief Flows of Robust Online Learning May 26, 2015 General Classification regression
— Unverified 0BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Aug 17, 2016 Deep Reinforcement Learning Efficient Exploration
— Unverified 0An Information-Theoretic Analysis of Thompson Sampling with Infinite Action Spaces Feb 4, 2025 Thompson Sampling
— Unverified 0BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Nov 15, 2017 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Best Arm Identification in Batched Multi-armed Bandit Problems Dec 21, 2023 Marketing Thompson Sampling
— Unverified 0Active RLHF via Best Policy Learning from Trajectory Preference Feedback Jan 31, 2025 Thompson Sampling
— Unverified 0Better Optimism By Bayes: Adaptive Planning with Rich Models Feb 9, 2014 Model-based Reinforcement Learning Reinforcement Learning
— Unverified 0Blind Exploration and Exploitation of Stochastic Experts Apr 2, 2021 Thompson Sampling
— Unverified 0Bootstrapped Thompson Sampling and Deep Exploration Jul 1, 2015 reinforcement-learning Reinforcement Learning
— Unverified 0BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings Nov 30, 2024 Bayesian Optimization Policy Gradient Methods
— Unverified 0Calibrated Fairness in Bandits Jul 6, 2017 Decision Making Fairness
— Unverified 0A Note on Information-Directed Sampling and Thompson Sampling Mar 24, 2015 Thompson Sampling
— Unverified 0An Unbiased Data Collection and Content Exploitation/Exploration Strategy for Personalization Apr 12, 2016 Recommendation Systems Thompson Sampling
— Unverified 0Causal Bandits without prior knowledge using separating sets Sep 16, 2020 Causal Discovery Decision Making
— Unverified 0Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems Mar 5, 2024 Thompson Sampling
— Unverified 0Bayesian Quantile and Expectile Optimisation Jan 12, 2020 Bayesian Optimisation Gaussian Processes
— Unverified 0