| MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench | Aug 1, 2024 | Humanoid ControlMuJoCo | CodeCode Available | 5 |
| Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation | May 31, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 5 |
| Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation | May 2, 2024 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 5 |
| Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer | Apr 8, 2024 | MuJoCoPhysical Simulations | CodeCode Available | 5 |
| EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine | Jun 21, 2022 | MuJoCoreinforcement-learning | CodeCode Available | 5 |
| Streaming Deep Reinforcement Learning Finally Works | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library | Dec 25, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 3 |
| Learning Bipedal Walking On Planned Footsteps For Humanoid Robots | Jul 26, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 |
| Tianshou: a Highly Modularized Deep Reinforcement Learning Library | Jul 29, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 |
| Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement | Oct 15, 2024 | DisentanglementInductive Bias | CodeCode Available | 2 |
| Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning | May 27, 2024 | Gym halfcheetah-mediumGym halfcheetah-medium-expert | CodeCode Available | 2 |
| Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization | May 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| Diffusion Actor-Critic with Entropy Regulator | May 24, 2024 | Decision MakingMuJoCo | CodeCode Available | 2 |
| Simple Policy Optimization | Jan 29, 2024 | MuJoCo | CodeCode Available | 2 |
| Text2Reward: Reward Shaping with Language Models for Reinforcement Learning | Sep 20, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 2 |
| Maximum Entropy Heterogeneous-Agent Reinforcement Learning | Jun 19, 2023 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Multi-Agent Reinforcement Learning is a Sequence Modeling Problem | May 30, 2022 | Decision MakingMuJoCo | CodeCode Available | 2 |
| JORLDY: a fully customizable open source framework for reinforcement learning | Apr 11, 2022 | MuJoCoOpenAI Gym | CodeCode Available | 2 |
| Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation | Jun 24, 2021 | MuJoCoOpenAI Gym | CodeCode Available | 2 |
| robosuite: A Modular Simulation Framework and Benchmark for Robot Learning | Sep 25, 2020 | Gesture GenerationMuJoCo | CodeCode Available | 2 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Reinforcement Learning for Ballbot Navigation in Uneven Terrain | May 23, 2025 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Model Tensor Planning | May 2, 2025 | modelModel Predictive Control | CodeCode Available | 1 |
| An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation | Mar 13, 2025 | MuJoCo | CodeCode Available | 1 |
| Maximum Entropy Reinforcement Learning with Diffusion Policy | Feb 17, 2025 | Efficient ExplorationMuJoCo | CodeCode Available | 1 |