| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Order Matters: Agent-by-agent Policy Optimization | Feb 13, 2023 | MuJoCo | CodeCode Available | 1 | 5 |
| Partial advantage estimator for proximal policy optimization | Jan 26, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 1 | 5 |
| An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet | May 12, 2021 | MuJoCoMulti-Goal Reinforcement Learning | CodeCode Available | 1 | 5 |
| FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning | Oct 4, 2020 | GPUMuJoCo | CodeCode Available | 1 | 5 |
| An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation | Mar 13, 2025 | MuJoCo | CodeCode Available | 1 | 5 |
| Converting Biomechanical Models from OpenSim to MuJoCo | Jun 17, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 | 5 |
| RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement Learning | Nov 5, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 | 5 |
| FM-TS: Flow Matching for Time Series Generation | Nov 12, 2024 | BenchmarkingImputation | CodeCode Available | 1 | 5 |
| Reset-Free Lifelong Learning with Skill-Space Planning | Dec 7, 2020 | Lifelong learningMuJoCo | CodeCode Available | 1 | 5 |
| Revisiting Design Choices in Proximal Policy Optimization | Sep 23, 2020 | MuJoCo | CodeCode Available | 1 | 5 |
| AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Feb 3, 2023 | DiversityMuJoCo | CodeCode Available | 1 | 5 |
| Cross-Modal Domain Adaptation for Reinforcement Learning | Jan 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 | 5 |
| Multi-Agent Trust Region Learning | Jan 1, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 | 5 |
| MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration | Jun 15, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 | 5 |
| Learnings Options End-to-End for Continuous Action Tasks | Nov 30, 2017 | MuJoCo | CodeCode Available | 1 | 5 |
| Lipschitz-constrained Unsupervised Skill Discovery | Feb 2, 2022 | DiversityMuJoCo | CodeCode Available | 1 | 5 |
| Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference | Feb 7, 2024 | MuJoCo | CodeCode Available | 1 | 5 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 | 5 |
| A Bayesian Approach to Robust Inverse Reinforcement Learning | Sep 15, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Conditioning Sparse Variational Gaussian Processes for Online Decision-making | Oct 28, 2021 | Active LearningDecision Making | CodeCode Available | 1 | 5 |
| LLM-Empowered State Representation for Reinforcement Learning | Jul 18, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 | 5 |
| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 | 5 |
| Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Oct 11, 2024 | DiversityMuJoCo | CodeCode Available | 1 | 5 |
| Learning Invariant Representations for Reinforcement Learning without Reconstruction | Jun 18, 2020 | Causal InferenceMuJoCo | CodeCode Available | 1 | 5 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Game-Theoretic Approach to Multi-Agent Trust Region Optimization | Jun 12, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 | 5 |
| Learning Successor Features the Simple Way | Oct 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture | May 28, 2021 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Imitation Learning with Sinkhorn Distances | Aug 20, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 | 5 |
| DeepMind Control Suite | Jan 2, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| DART: Noise Injection for Robust Imitation Learning | Mar 27, 2017 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations | Oct 14, 2024 | Dimensionality ReductionMuJoCo | CodeCode Available | 1 | 5 |
| Improving Sample Efficiency in Model-Free Reinforcement Learning from Images | Oct 2, 2019 | Image ReconstructionMuJoCo | CodeCode Available | 1 | 5 |
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | May 22, 2024 | IngenuityMuJoCo | CodeCode Available | 1 | 5 |
| How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization | Apr 29, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Delay-Aware Model-Based Reinforcement Learning for Continuous Control | May 11, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage | Jun 6, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Joint action loss for proximal policy optimization | Jan 26, 2023 | Dota 2MuJoCo | CodeCode Available | 1 | 5 |
| ARLO: A Framework for Automated Reinforcement Learning | May 20, 2022 | feature selectionMuJoCo | CodeCode Available | 1 | 5 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 | 5 |
| Model Tensor Planning | May 2, 2025 | modelModel Predictive Control | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Fast Adaptation via Policy-Dynamics Value Functions | Jul 6, 2020 | MuJoCo | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation | Jul 17, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 1 | 5 |
| Evolution Strategies as a Scalable Alternative to Reinforcement Learning | Mar 10, 2017 | Atari GamesMuJoCo | CodeCode Available | 1 | 5 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |