| Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning | Dec 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Meta-Learning of Structured Task Distributions in Humans and Machines | Oct 5, 2020 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Meta-Reinforcement Learning by Tracking Task Non-stationarity | May 18, 2021 | Meta Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Jun 24, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system | Oct 28, 2024 | Code GenerationHumanEval | CodeCode Available | 0 | 5 |
| Context Meta-Reinforcement Learning via Neuromodulation | Oct 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Exchangeable Models in Meta Reinforcement Learning | Jun 12, 2020 | Meta Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming | Dec 15, 2023 | Autonomous DrivingMeta-Learning | CodeCode Available | 0 | 5 |
| learn2learn: A Library for Meta-Learning Research | Aug 27, 2020 | Few-Shot LearningMeta-Learning | CodeCode Available | 0 | 5 |
| Learning to reinforcement learn | Nov 17, 2016 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 | 5 |