| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| Continuous Control for Searching and Planning with a Learned Model | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization | Apr 4, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) | Mar 2, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Continuous Neural Algorithmic Planners | Nov 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling | Nov 11, 2022 | MuJoCoNavigate | —Unverified | 0 |
| Cooperative Heterogeneous Deep Reinforcement Learning | Nov 2, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Cooperative Multi-Agent Deep Reinforcement Learning in Content Ranking Optimization | Aug 8, 2024 | Deep Reinforcement LearningInformation Retrieval | —Unverified | 0 |
| Cross-Domain Imitation Learning with a Dual Structure | Jun 2, 2020 | Imitation LearningMuJoCo | —Unverified | 0 |
| CrossNorm: On Normalization for Off-Policy Reinforcement Learning | Sep 25, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Data Valuation for Offline Reinforcement Learning | May 19, 2022 | Data ValuationDeep Reinforcement Learning | —Unverified | 0 |
| DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning | Jun 26, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies | Jun 12, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Decorrelated Double Q-learning | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep exploration by novelty-pursuit with maximum state entropy | Sep 25, 2019 | Efficient ExplorationMuJoCo | —Unverified | 0 |
| Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks | Sep 20, 2017 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning | Mar 11, 2024 | Model Predictive ControlMuJoCo | —Unverified | 0 |
| DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays | Jun 5, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety | May 8, 2023 | MuJoCo | —Unverified | 0 |
| Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback | May 13, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study | Jul 8, 2025 | MuJoCoRecommendation Systems | —Unverified | 0 |
| DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects | Dec 23, 2023 | MuJoCoPosition | —Unverified | 0 |
| DIDA: Denoised Imitation Learning based on Domain Adaptation | Apr 4, 2024 | Domain AdaptationImitation Learning | —Unverified | 0 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| DisTop: Discovering a Topological representation to learn diverse and rewarding skills | Jun 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Distributional Decision Transformer for Hindsight Information Matching | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Distributionally Robust Statistical Verification with Imprecise Neural Networks | Aug 28, 2023 | Active LearningMuJoCo | —Unverified | 0 |
| Diverse Imitation Learning via Self-OrganizingGenerative Models | Sep 29, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes | Aug 1, 2023 | Computational EfficiencyGrasp Generation | —Unverified | 0 |
| Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation | Apr 14, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DropoutDAgger: A Bayesian Approach to Safe Imitation Learning | Sep 18, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization | Sep 1, 2022 | Bayesian InferenceKnowledge Distillation | —Unverified | 0 |
| Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods | Jan 18, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Efficient Diversity-based Experience Replay for Deep Reinforcement Learning | Oct 27, 2024 | Atari GamesDecision Making | —Unverified | 0 |
| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling | Oct 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| ELSIM: End-to-end learning of reusable skills through intrinsic motivation | Jun 23, 2020 | Developmental LearningMuJoCo | —Unverified | 0 |
| Enhanced DACER Algorithm with High Diffusion Efficiency | May 29, 2025 | DenoisingImitation Learning | —Unverified | 0 |
| EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning | Jul 22, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Entropy Augmented Reinforcement Learning | Aug 19, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Episodic Reinforcement Learning with Expanded State-reward Space | Jan 19, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning | Aug 17, 2024 | Density EstimationImitation Learning | —Unverified | 0 |
| Markov flow policy -- deep MC | May 1, 2024 | MuJoCo | —Unverified | 0 |
| Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations | Sep 16, 2022 | Decision MakingImitation Learning | —Unverified | 0 |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation | Jul 25, 2024 | MuJoCo | —Unverified | 0 |
| Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees | Oct 4, 2022 | counterfactualImitation Learning | —Unverified | 0 |
| Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning | Jun 15, 2022 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning | May 29, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents | Jun 18, 2024 | continuous-controlContinuous Control | —Unverified | 0 |