| Action Redundancy in Reinforcement Learning | Feb 22, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric | Oct 20, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| A Pontryagin Perspective on Reinforcement Learning | May 28, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback | May 13, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | Feb 26, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Modular Recurrence in Contextual MDPs for Universal Morphology Control | Jun 10, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| CEIL: Generalized Contextual Imitation Learning | Jun 26, 2023 | D4RLImitation Learning | —Unverified | 0 |
| CAT-SAC: Soft Actor-Critic with Curiosity-Aware Entropy Temperature | Jan 1, 2021 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays | Jun 5, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |