Smart Routing with Precise Link Estimation: DSEE-Based Anypath Routing for Reliable Wireless Networking May 16, 2024 Thompson Sampling
— Unverified 0Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits May 15, 2024 Autonomous Vehicles Thompson Sampling
— Unverified 0Thompson Sampling for Infinite-Horizon Discounted Decision Processes May 14, 2024 Thompson Sampling
— Unverified 0Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit May 7, 2024 Federated Learning Thompson Sampling
Code Code Available 0Efficient and Adaptive Posterior Sampling Algorithms for Bandits May 2, 2024 Thompson Sampling
— Unverified 0Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation May 2, 2024 Bayesian Optimization Conversational Recommendation
— Unverified 0Bayesian-Guided Generation of Synthetic Microbiomes with Minimized Pathogenicity Apr 29, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Apr 16, 2024 Federated Learning Multi-agent Reinforcement Learning
— Unverified 0Online Learning of Decision Trees with Thompson Sampling Apr 9, 2024 Interpretable Machine Learning Thompson Sampling
Code Code Available 0Feel-Good Thompson Sampling for Contextual Dueling Bandits Apr 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Apr 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0On the Importance of Uncertainty in Decision-Making with Large Language Models Apr 3, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Meta Learning in Bandits within Shared Affine Subspaces Mar 31, 2024 Meta-Learning Thompson Sampling
— Unverified 0A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food Mar 15, 2024 Scheduling Thompson Sampling
— Unverified 0Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0TS-RSR: A provably efficient approach for batch Bayesian Optimization Mar 7, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems Mar 5, 2024 Thompson Sampling
— Unverified 0Epsilon-Greedy Thompson Sampling to Bayesian Optimization Mar 1, 2024 Bayesian Optimization Cantilever Beam
— Unverified 0Influencing Bandits: Arm Selection for Preference Shaping Feb 29, 2024 Recommendation Systems Thompson Sampling
— Unverified 0Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits Feb 23, 2024 Thompson Sampling
— Unverified 0Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification Feb 16, 2024 Thompson Sampling
— Unverified 0Thompson Sampling in Partially Observable Contextual Bandits Feb 15, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 0Tree Ensembles for Contextual Bandits Feb 10, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Optimistic Thompson Sampling for No-Regret Learning in Unknown Games Feb 7, 2024 Decision Making Thompson Sampling
— Unverified 0Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Efficient Exploration for LLMs Feb 1, 2024 Efficient Exploration Thompson Sampling
— Unverified 0Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo Jan 22, 2024 Thompson Sampling
Code Code Available 0Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis Jan 21, 2024 Thompson Sampling
— Unverified 0Model-Free Approximate Bayesian Learning for Large-Scale Conversion Funnel Optimization Jan 12, 2024 Decision Making Marketing
— Unverified 0Decentralized Multi-Agent Active Search and Tracking when Targets Outnumber Agents Jan 6, 2024 Decision Making Thompson Sampling
— Unverified 0Improving sample efficiency of high dimensional Bayesian optimization with MCMC Jan 5, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search Dec 28, 2023 Multi-Agent Path Finding Thompson Sampling
Code Code Available 1Zero-Inflated Bandits Dec 25, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs Dec 24, 2023 Computational Efficiency Thompson Sampling
Code Code Available 0Best Arm Identification in Batched Multi-armed Bandit Problems Dec 21, 2023 Marketing Thompson Sampling
— Unverified 0Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit Dec 5, 2023 Thompson Sampling
— Unverified 0The Sliding Regret in Stochastic Bandits: Discriminating Index and Randomized Policies Nov 30, 2023 Thompson Sampling
— Unverified 0Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study Nov 24, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Probabilistic Inference in Reinforcement Learning Done Right Nov 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Distributed Neural Linear Thompson Sampling Framework to Achieve URLLC in Industrial IoT Nov 21, 2023 Scheduling Thompson Sampling
— Unverified 0Adaptive Interventions with User-Defined Goals for Health Behavior Change Nov 16, 2023 Thompson Sampling
Code Code Available 0Exploration via linearly perturbed loss minimisation Nov 13, 2023 Thompson Sampling
— Unverified 0Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds Nov 7, 2023 Bayesian Optimization Thompson Sampling
— Unverified 0Batch Bayesian Optimization for Replicable Experimental Design Nov 2, 2023 AutoML Bayesian Optimization
— Unverified 0Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning Oct 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Dual-Directed Algorithm Design for Efficient Pure Exploration Oct 30, 2023 Thompson Sampling
— Unverified 0