| Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Jul 20, 2024 | AllAutonomous Driving | —Unverified | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation | May 24, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon | Sep 28, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection | Apr 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents | Aug 28, 2022 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Joint AP Probing and Scheduling: A Contextual Bandit Approach | Aug 6, 2021 | Decision MakingScheduling | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning | May 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Deciding What to Learn: A Rate-Distortion Approach | Jan 15, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Jun 15, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents | Oct 21, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Dec 5, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Large Sequence Models for Sequential Decision-Making: A Survey | Jun 24, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Jul 15, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning | Mar 15, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Latent Variable Algorithms for Multimodal Learning and Sensor Fusion | Apr 23, 2019 | Activity RecognitionDecision Making | —Unverified | 0 |
| LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization | Nov 18, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 |
| A Survey on Interpretable Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Learning Universal Policies via Text-Guided Video Generation | Jan 31, 2023 | Decision MakingImage Generation | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management | Sep 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Automaton-Based Representations of Task Knowledge from Generative Language Models | Dec 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning-Based UAV Trajectory Optimization with Collision Avoidance and Connectivity Constraints | Apr 3, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect | Jun 18, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network | Oct 16, 2021 | Behavioural cloningDecision Making | —Unverified | 0 |
| Causal Explanations for Sequential Decision Making Under Uncertainty | May 30, 2022 | Causal InferenceDecision Making | —Unverified | 0 |
| Learning Curricula in Open-Ended Worlds | Dec 3, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time | Sep 20, 2021 | Decision Makingregression | —Unverified | 0 |
| Learning to Make Decisions via Submodular Regularization | Jan 1, 2021 | Active LearningBayesian Optimization | —Unverified | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Actor-Critic Algorithms for Risk-Sensitive MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Efficient Representations for Reinforcement Learning | Aug 28, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction | Mar 3, 2017 | Decision MakingDependency Parsing | —Unverified | 0 |
| Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control | Feb 14, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning | May 21, 2017 | BenchmarkingDecision Making | —Unverified | 0 |
| Learning Functionally Decomposed Hierarchies for Continuous Navigation Tasks | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Markov models via low-rank optimization | Jun 28, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning | May 21, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Learning Mobile Robot Navigation in the Dense Crowd with Deep Reinforcement Learning | Dec 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking | Oct 21, 2020 | Decision MakingFraud Detection | —Unverified | 0 |
| Information Directed Sampling for Linear Partial Monitoring | Feb 25, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Automated Cyber Defence: A Review | Mar 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Automated Reinforcement Learning: An Overview | Jan 13, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Information Avoidance and Overvaluation in Sequential Decision Making under Epistemic Constraints | Jun 9, 2021 | Decision MakingManagement | —Unverified | 0 |