| DeepMind Control Suite | Jan 2, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Fast Adaptation via Policy-Dynamics Value Functions | Jul 6, 2020 | MuJoCo | CodeCode Available | 1 | 5 |
| Imitation Learning with Sinkhorn Distances | Aug 20, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Learnings Options End-to-End for Continuous Action Tasks | Nov 30, 2017 | MuJoCo | CodeCode Available | 1 | 5 |
| Improving Sample Efficiency in Model-Free Reinforcement Learning from Images | Oct 2, 2019 | Image ReconstructionMuJoCo | CodeCode Available | 1 | 5 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning | Jun 7, 2024 | Contrastive LearningMeta Reinforcement Learning | CodeCode Available | 1 | 5 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage | May 21, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Partial advantage estimator for proximal policy optimization | Jan 26, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 1 | 5 |