| MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure | May 1, 2024 | Efficient ExplorationMuJoCo | —Unverified | 0 |
| MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL | May 31, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning | Apr 29, 2023 | Meta Reinforcement LearningMuJoCo | —Unverified | 0 |
| Meta-Reinforcement Learning via Exploratory Task Clustering | Feb 15, 2023 | ClusteringMeta Reinforcement Learning | —Unverified | 0 |
| Meta Reinforcement Learning with Distribution of Exploration Parameters Learned by Evolution Strategies | Dec 29, 2018 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Jul 25, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Jan 31, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| LLMs for sensory-motor control: Combining in-context and iterative learning | Jun 5, 2025 | MuJoCo | CodeCode Available | 0 |
| Online Reinforcement Learning in Non-Stationary Context-Driven Environments | Feb 4, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 |