| Model Tensor Planning | May 2, 2025 | modelModel Predictive Control | CodeCode Available | 1 | 5 |
| Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning | Jul 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk | Jun 18, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage | Jun 6, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Learning Invariant Representations for Reinforcement Learning without Reconstruction | Jun 18, 2020 | Causal InferenceMuJoCo | CodeCode Available | 1 | 5 |
| UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | May 1, 2024 | Decision MakingMuJoCo | CodeCode Available | 1 | 5 |
| An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet | May 12, 2021 | MuJoCoMulti-Goal Reinforcement Learning | CodeCode Available | 1 | 5 |
| Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning | Feb 13, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Lipschitz-constrained Unsupervised Skill Discovery | Feb 2, 2022 | DiversityMuJoCo | CodeCode Available | 1 | 5 |
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | May 22, 2024 | IngenuityMuJoCo | CodeCode Available | 1 | 5 |
| An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation | Mar 13, 2025 | MuJoCo | CodeCode Available | 1 | 5 |
| Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL | Jun 9, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| DART: Noise Injection for Robust Imitation Learning | Mar 27, 2017 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage | May 21, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity | Oct 31, 2024 | MuJoCoQ-Learning | CodeCode Available | 1 | 5 |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Nov 7, 2022 | MuJoCo | CodeCode Available | 1 | 5 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Cross-Modal Domain Adaptation for Reinforcement Learning | Jan 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 | 5 |
| Model-free Policy Learning with Reward Gradients | Mar 9, 2021 | Continuous Controlmodel | CodeCode Available | 1 | 5 |
| LLM-Empowered State Representation for Reinforcement Learning | Jul 18, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 | 5 |
| Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control | Oct 15, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments | Jan 20, 2021 | MuJoCo | CodeCode Available | 1 | 5 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| DeepMind Control Suite | Jan 2, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards | May 27, 2019 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 | 5 |
| Converting Biomechanical Models from OpenSim to MuJoCo | Jun 17, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 | 5 |
| Imitation Learning with Sinkhorn Distances | Aug 20, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Collaborative Evolutionary Reinforcement Learning | May 2, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Quadratic Actor Network for Model-Free Reinforcement Learning | Mar 11, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Lyapunov-based Safe Policy Optimization for Continuous Control | Jan 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Pragmatic Look at Deep Imitation Learning | Aug 4, 2021 | Behavioural cloningD4RL | CodeCode Available | 0 | 5 |
| Client Selection for Federated Policy Optimization with Environment Heterogeneity | May 18, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards | Dec 26, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Application of linear regression method to the deep reinforcement learning in continuous action cases | Mar 19, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| LLMs for sensory-motor control: Combining in-context and iterative learning | Jun 5, 2025 | MuJoCo | CodeCode Available | 0 | 5 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| CGAR: Critic Guided Action Redistribution in Reinforcement Leaning | Jun 23, 2022 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Jul 25, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Online Reinforcement Learning in Non-Stationary Context-Driven Environments | Feb 4, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 | 5 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | CodeCode Available | 0 | 5 |
| CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents | Jun 19, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios | Oct 12, 2023 | Board GamesDecision Making | CodeCode Available | 0 | 5 |
| Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Jun 24, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Learning to Play Cup-and-Ball with Noisy Camera Observations | Jul 19, 2020 | MuJoCo | CodeCode Available | 0 | 5 |