DRL-based Joint Resource Scheduling of eMBB and URLLC in O-RAN Jul 16, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits Jun 20, 2024 Bayesian Inference Thompson Sampling
— Unverified 0Preferential Multi-Objective Bayesian Optimization Jun 20, 2024 Autonomous Driving Bayesian Optimization
— Unverified 0Joint User Association and Pairing in Multi-UAV-Assisted NOMA Networks: A Decaying-Epsilon Thompson Sampling Framework Jun 20, 2024 Thompson Sampling
— Unverified 0Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents Jun 18, 2024 continuous-control Continuous Control
— Unverified 0More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling Jun 18, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Jun 16, 2024 Multi-Armed Bandits Policy Gradient Methods
— Unverified 0Graph Neural Thompson Sampling Jun 15, 2024 Decision Making Graph Neural Network
— Unverified 0A Federated Online Restless Bandit Framework for Cooperative Resource Allocation Jun 12, 2024 Federated Learning Multi-Armed Bandits
— Unverified 0DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation Jun 10, 2024 Thompson Sampling
— Unverified 0Two-Stage Resource Allocation in Reconfigurable Intelligent Surface Assisted Hybrid Networks via Multi-Player Bandits Jun 9, 2024 Thompson Sampling
— Unverified 0Adaptively Learning to Select-Rank in Online Platforms Jun 7, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism Jun 6, 2024 Thompson Sampling
— Unverified 0Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(T) Regret May 29, 2024 Thompson Sampling
— Unverified 0Posterior Sampling via Autoregressive Generation May 29, 2024 Articles Decision Making
— Unverified 0Cost-efficient Knowledge-based Question Answering with Large Language Models May 27, 2024 Knowledge Graphs Model Selection
— Unverified 0On Bits and Bandits: Quantifying the Regret-Information Trade-off May 26, 2024 Decision Making Question Answering
Code Code Available 0Code Repair with LLMs gives an Exploration-Exploitation Tradeoff May 26, 2024 Code Repair Language Modeling
— Unverified 0Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits May 24, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0No Algorithmic Collusion in Two-Player Blindfolded Game with Thompson Sampling May 23, 2024 Thompson Sampling
— Unverified 0Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making May 23, 2024 Decision Making Sequential Decision Making
— Unverified 0Smart Routing with Precise Link Estimation: DSEE-Based Anypath Routing for Reliable Wireless Networking May 16, 2024 Thompson Sampling
— Unverified 0Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits May 15, 2024 Autonomous Vehicles Thompson Sampling
— Unverified 0Thompson Sampling for Infinite-Horizon Discounted Decision Processes May 14, 2024 Thompson Sampling
— Unverified 0Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit May 7, 2024 Federated Learning Thompson Sampling
Code Code Available 0Efficient and Adaptive Posterior Sampling Algorithms for Bandits May 2, 2024 Thompson Sampling
— Unverified 0Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation May 2, 2024 Bayesian Optimization Conversational Recommendation
— Unverified 0Bayesian-Guided Generation of Synthetic Microbiomes with Minimized Pathogenicity Apr 29, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Apr 16, 2024 Federated Learning Multi-agent Reinforcement Learning
— Unverified 0Online Learning of Decision Trees with Thompson Sampling Apr 9, 2024 Interpretable Machine Learning Thompson Sampling
Code Code Available 0Feel-Good Thompson Sampling for Contextual Dueling Bandits Apr 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Apr 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0On the Importance of Uncertainty in Decision-Making with Large Language Models Apr 3, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Meta Learning in Bandits within Shared Affine Subspaces Mar 31, 2024 Meta-Learning Thompson Sampling
— Unverified 0A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food Mar 15, 2024 Scheduling Thompson Sampling
— Unverified 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0TS-RSR: A provably efficient approach for batch Bayesian Optimization Mar 7, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems Mar 5, 2024 Thompson Sampling
— Unverified 0Epsilon-Greedy Thompson Sampling to Bayesian Optimization Mar 1, 2024 Bayesian Optimization Cantilever Beam
— Unverified 0Influencing Bandits: Arm Selection for Preference Shaping Feb 29, 2024 Recommendation Systems Thompson Sampling
— Unverified 0Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits Feb 23, 2024 Thompson Sampling
— Unverified 0Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification Feb 16, 2024 Thompson Sampling
— Unverified 0Thompson Sampling in Partially Observable Contextual Bandits Feb 15, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 0Tree Ensembles for Contextual Bandits Feb 10, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Optimistic Thompson Sampling for No-Regret Learning in Unknown Games Feb 7, 2024 Decision Making Thompson Sampling
— Unverified 0Efficient Exploration for LLMs Feb 1, 2024 Efficient Exploration Thompson Sampling
— Unverified 0Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo Jan 22, 2024 Thompson Sampling
Code Code Available 0