| Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search | May 20, 2024 | ClusteringSequential Decision Making | CodeCode Available | 0 |
| A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback | May 20, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System | May 19, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI | May 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | May 13, 2024 | Decision MakingDiagnostic | —Unverified | 0 |
| Learning Planning Abstractions from Language | May 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows | May 6, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning | May 4, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery | May 3, 2024 | Decision MakingInterpretable Machine Learning | —Unverified | 0 |
| Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | May 2, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Apr 29, 2024 | Bayesian InferenceGaussian Processes | —Unverified | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Apr 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning | Apr 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Apr 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Apr 10, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control | Apr 10, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Apr 10, 2024 | Decision MakingMeta Reinforcement Learning | CodeCode Available | 0 |
| Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection | Apr 10, 2024 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 0 |
| Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus Erythematosus | Apr 9, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Apr 7, 2024 | D4RLDecision Making | —Unverified | 0 |
| Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks | Apr 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Multi-granular Adversarial Attacks against Black-box Neural Ranking Models | Apr 2, 2024 | Adversarial AttackDecision Making | —Unverified | 0 |
| Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems | Mar 26, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation | Mar 24, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 |
| Sequential Decision-Making for Inline Text Autocomplete | Mar 21, 2024 | Decision MakingLanguage Modelling | —Unverified | 0 |
| Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion | Mar 19, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Fast Value Tracking for Deep Reinforcement Learning | Mar 19, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Supervised Fine-Tuning as Inverse Reinforcement Learning | Mar 18, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 |
| Regret Minimization via Saddle Point Optimization | Mar 15, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 |
| LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem | Mar 10, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Mar 8, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Cooperative Bayesian Optimization for Imperfect Agents | Mar 7, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation | Mar 6, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds | Mar 1, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Mar 1, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Feb 26, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reward Design for Justifiable Sequential Decision-Making | Feb 24, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Feb 22, 2024 | Autonomous RacingDecision Making | —Unverified | 0 |
| On the Performance of Empirical Risk Minimization with Smoothed Data | Feb 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers | Feb 20, 2024 | Decision MakingDecoder | —Unverified | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |