| How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization | Apr 29, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| S^2AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic | May 2, 2024 | MuJoCoVariational Inference | CodeCode Available | 1 |
| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 |
| EDGE: Explaining Deep Reinforcement Learning Policies | Dec 1, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Coagent Networks: Generalized and Scaled | May 16, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| DisTop: Discovering a Topological representation to learn diverse and rewarding skills | Jun 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Closed Loop Interactive Embodied Reasoning for Robot Manipulation | Apr 23, 2024 | MuJoCoRobot Manipulation | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Distributional Decision Transformer for Hindsight Information Matching | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning | Feb 9, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels | Feb 19, 2021 | MuJoCo | —Unverified | 0 |
| Action Redundancy in Reinforcement Learning | Feb 22, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric | Oct 20, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| A Pontryagin Perspective on Reinforcement Learning | May 28, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| DIDA: Denoised Imitation Learning based on Domain Adaptation | Apr 4, 2024 | Domain AdaptationImitation Learning | —Unverified | 0 |
| Distributionally Robust Statistical Verification with Imprecise Neural Networks | Aug 28, 2023 | Active LearningMuJoCo | —Unverified | 0 |
| C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | Feb 26, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Modular Recurrence in Contextual MDPs for Universal Morphology Control | Jun 10, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| CEIL: Generalized Contextual Imitation Learning | Jun 26, 2023 | D4RLImitation Learning | —Unverified | 0 |
| CAT-SAC: Soft Actor-Critic with Curiosity-Aware Entropy Temperature | Jan 1, 2021 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study | Jul 8, 2025 | MuJoCoRecommendation Systems | —Unverified | 0 |
| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |