| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Offline Model-based Adaptable Policy Learning | Dec 1, 2021 | Decision Makingmodel | CodeCode Available | 1 | 5 |
| Reset-Free Lifelong Learning with Skill-Space Planning | Dec 7, 2020 | Lifelong learningMuJoCo | CodeCode Available | 1 | 5 |
| State-only Imitation with Transition Dynamics Mismatch | Feb 27, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 | 5 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Collaborative Evolutionary Reinforcement Learning | May 2, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Quadratic Actor Network for Model-Free Reinforcement Learning | Mar 11, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning | Sep 17, 2019 | MuJoCoOpenAI Gym | CodeCode Available | 0 | 5 |
| Client Selection for Federated Policy Optimization with Environment Heterogeneity | May 18, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 | 5 |
| Application of linear regression method to the deep reinforcement learning in continuous action cases | Mar 19, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| CGAR: Critic Guided Action Redistribution in Reinforcement Leaning | Jun 23, 2022 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents | Jun 19, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards | Dec 26, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning | Jun 10, 2025 | Data Augmentationmodel | CodeCode Available | 0 | 5 |
| Online Reinforcement Learning in Non-Stationary Context-Driven Environments | Feb 4, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 | 5 |
| Lyapunov-based Safe Policy Optimization for Continuous Control | Jan 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Jul 25, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Adaptive trajectory-constrained exploration strategy for deep reinforcement learning | Dec 27, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| A novel DDPG method with prioritized experience replay | Oct 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| LLMs for sensory-motor control: Combining in-context and iterative learning | Jun 5, 2025 | MuJoCo | CodeCode Available | 0 | 5 |