| Reset-Free Lifelong Learning with Skill-Space Planning | Dec 7, 2020 | Lifelong learningMuJoCo | CodeCode Available | 1 | 5 |
| Generalized Decision Transformer for Offline Hindsight Information Matching | Nov 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Robust Deep Reinforcement Learning through Adversarial Loss | Aug 5, 2020 | Adversarial AttackAtari Games | CodeCode Available | 1 | 5 |
| ARLO: A Framework for Automated Reinforcement Learning | May 20, 2022 | feature selectionMuJoCo | CodeCode Available | 1 | 5 |
| How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization | Apr 29, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet | May 12, 2021 | MuJoCoMulti-Goal Reinforcement Learning | CodeCode Available | 1 | 5 |
| Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation | Aug 17, 2017 | Atari Gamescontinuous-control | CodeCode Available | 1 | 5 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Conditioning Sparse Variational Gaussian Processes for Online Decision-making | Oct 28, 2021 | Active LearningDecision Making | CodeCode Available | 1 | 5 |
| An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation | Mar 13, 2025 | MuJoCo | CodeCode Available | 1 | 5 |
| Imitation Learning with Sinkhorn Distances | Aug 20, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 | 5 |
| Joint action loss for proximal policy optimization | Jan 26, 2023 | Dota 2MuJoCo | CodeCode Available | 1 | 5 |
| Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning | Jun 7, 2024 | Contrastive LearningMeta Reinforcement Learning | CodeCode Available | 1 | 5 |
| Learning Successor Features the Simple Way | Oct 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Maximum Entropy Reinforcement Learning with Diffusion Policy | Feb 17, 2025 | Efficient ExplorationMuJoCo | CodeCode Available | 1 | 5 |
| Model Tensor Planning | May 2, 2025 | modelModel Predictive Control | CodeCode Available | 1 | 5 |
| Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference | Feb 7, 2024 | MuJoCo | CodeCode Available | 1 | 5 |
| Learning Invariant Representations for Reinforcement Learning without Reconstruction | Jun 18, 2020 | Causal InferenceMuJoCo | CodeCode Available | 1 | 5 |
| Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning | Sep 23, 2021 | LEMMAMuJoCo | CodeCode Available | 1 | 5 |
| Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling | Sep 20, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |