Sequential Best-Arm Identification with Application to Brain-Computer Interface May 17, 2023 Brain Computer Interface EEG
— Unverified 0Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions May 13, 2023 Bayesian Inference Thompson Sampling
— Unverified 0Trajectory-oriented optimization of stochastic epidemiological models May 6, 2023 Thompson Sampling
Code Code Available 0An improved regret analysis for UCB-N and TS-N May 6, 2023 LEMMA Thompson Sampling
— Unverified 0Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards Apr 28, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards Apr 26, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Efficiently Tackling Million-Dimensional Multiobjective Problems: A Direction Sampling and Fine-Tuning Approach Apr 8, 2023 Multiobjective Optimization Recommendation Systems
— Unverified 0Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms Apr 6, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent Active Search Apr 4, 2023 All Disaster Response
— Unverified 0Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches Mar 21, 2023 Benchmarking Thompson Sampling
— Unverified 0Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling Mar 16, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning Mar 10, 2023 Thompson Sampling
— Unverified 0A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms Mar 10, 2023 Thompson Sampling
— Unverified 0Thompson Sampling for Linear Bandit Problems with Normal-Gamma Priors Mar 6, 2023 Thompson Sampling
— Unverified 0The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0When Combinatorial Thompson Sampling meets Approximation Regret Feb 22, 2023 Thompson Sampling
— Unverified 0Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Feb 18, 2023 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0A Bandit Approach to Online Pricing for Heterogeneous Edge Resource Allocation Feb 14, 2023 Edge-computing Thompson Sampling
— Unverified 0Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration Feb 8, 2023 Bayesian Inference Thompson Sampling
— Unverified 0Leveraging Demonstrations to Improve Online Learning: Quality Matters Feb 7, 2023 Thompson Sampling
— Unverified 0Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits Feb 3, 2023 Thompson Sampling
— Unverified 0Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints Jan 24, 2023 Thompson Sampling
Code Code Available 0Differentially Private Online Bayesian Estimation With Adaptive Truncation Jan 19, 2023 Privacy Preserving Sensitivity
Code Code Available 0A Combinatorial Semi-Bandit Approach to Charging Station Selection for Electric Vehicles Jan 17, 2023 Combinatorial Optimization Thompson Sampling
— Unverified 0Thompson Sampling with Diffusion Generative Prior Jan 12, 2023 Decision Making Denoising
— Unverified 0Reinforcement Learning in Credit Scoring and Underwriting Dec 15, 2022 Decision Making Efficient Exploration
— Unverified 0Neural Bandits for Data Mining: Searching for Dangerous Polypharmacy Dec 10, 2022 Thompson Sampling
Code Code Available 0Online Learning-based Waveform Selection for Improved Vehicle Recognition in Automotive Radar Dec 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning Nov 23, 2022 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits Nov 11, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Atlas: Automate Online Service Configuration in Network Slicing Oct 30, 2022 Bayesian Optimization Safe Exploration
Code Code Available 0Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning Approach Oct 26, 2022 Thompson Sampling Variational Inference
Code Code Available 0Meta Learning of Interface Conditions for Multi-Domain Physics-Informed Neural Networks Oct 23, 2022 Meta-Learning Thompson Sampling
— Unverified 0The Typical Behavior of Bandit Algorithms Oct 11, 2022 Thompson Sampling
— Unverified 0Deep Active Ensemble Sampling For Image Classification Oct 11, 2022 Active Learning Classification
— Unverified 0Cost Aware Asynchronous Multi-Agent Active Search Oct 5, 2022 Decision Making Thompson Sampling
— Unverified 0Thompson Sampling with Virtual Helping Agents Sep 16, 2022 Decision Making Sequential Decision Making
— Unverified 0Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Nonparametric Contextual Bandit with Arm-level Eligibility Control for Customer Service Routing Sep 8, 2022 Thompson Sampling
— Unverified 0Sample Efficient Learning of Factored Embeddings of Tensor Fields Sep 1, 2022 Recommendation Systems Thompson Sampling
— Unverified 0Causal Bandits for Linear Structural Equation Models Aug 26, 2022 Thompson Sampling
Code Code Available 0Dynamic collaborative filtering Thompson Sampling for cross-domain advertisements recommendation Aug 25, 2022 Collaborative Filtering Recommendation Systems
— Unverified 0A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Non-Stationary Dynamic Pricing Via Actor-Critic Information-Directed Pricing Aug 19, 2022 Thompson Sampling
— Unverified 0Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits Aug 10, 2022 Management Multi-Armed Bandits
— Unverified 0Using Adaptive Experiments to Rapidly Help Students Aug 10, 2022 Thompson Sampling
— Unverified 0Bayesian Optimization-Based Beam Alignment for MmWave MIMO Communication Systems Jul 28, 2022 Bayesian Optimization Thompson Sampling
— Unverified 0SPRT-based Efficient Best Arm Identification in Stochastic Bandits Jul 22, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level Synthesis Jul 3, 2022 Active Learning Descriptive
— Unverified 0Ranking In Generalized Linear Bandits Jun 30, 2022 Diversity Multi-Armed Bandits
Code Code Available 0