| Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Jul 20, 2024 | AllAutonomous Driving | —Unverified | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation | May 24, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon | Sep 28, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control | Feb 14, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents | Aug 28, 2022 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Joint AP Probing and Scheduling: A Contextual Bandit Approach | Aug 6, 2021 | Decision MakingScheduling | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning | May 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Deciding What to Learn: A Rate-Distortion Approach | Jan 15, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Jun 15, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management | Sep 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network | Oct 16, 2021 | Behavioural cloningDecision Making | —Unverified | 0 |
| Large Sequence Models for Sequential Decision-Making: A Survey | Jun 24, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Jul 15, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning | Mar 15, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Latent Variable Algorithms for Multimodal Learning and Sensor Fusion | Apr 23, 2019 | Activity RecognitionDecision Making | —Unverified | 0 |
| LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization | Nov 18, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Actor-Critic Algorithms for Risk-Sensitive MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Curricula in Open-Ended Worlds | Dec 3, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |