| Policy Representation via Diffusion Probability Model for Reinforcement Learning | May 22, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling | Sep 20, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Reinforcement Learning with Random Delays | Oct 6, 2020 | Anatomycontinuous-control | CodeCode Available | 1 |
| ARLO: A Framework for Automated Reinforcement Learning | May 20, 2022 | feature selectionMuJoCo | CodeCode Available | 1 |
| Revisiting Design Choices in Proximal Policy Optimization | Sep 23, 2020 | MuJoCo | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet | May 12, 2021 | MuJoCoMulti-Goal Reinforcement Learning | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning through Adversarial Loss | Aug 5, 2020 | Adversarial AttackAtari Games | CodeCode Available | 1 |
| Converting Biomechanical Models from OpenSim to MuJoCo | Jun 17, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| DART: Noise Injection for Robust Imitation Learning | Mar 27, 2017 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation | Mar 13, 2025 | MuJoCo | CodeCode Available | 1 |
| Self-Supervised Exploration via Disagreement | Jun 10, 2019 | Active LearningEfficient Exploration | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Deconstructing the Inductive Biases of Hamiltonian Neural Networks | Feb 10, 2022 | MuJoCo | CodeCode Available | 1 |
| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| DeepMind Control Suite | Jan 2, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk | Jun 18, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | May 1, 2024 | Decision MakingMuJoCo | CodeCode Available | 1 |
| Nengo and low-power AI hardware for robust, embedded neurorobotics | Jul 20, 2020 | MuJoCo | CodeCode Available | 1 |