| Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables | Mar 19, 2019 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 | 5 |
| A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud | May 31, 2022 | CPUDecision Making | CodeCode Available | 0 | 5 |
| Modeling and Optimization Trade-off in Meta-learning | Oct 24, 2020 | Bilevel OptimizationMeta-Learning | CodeCode Available | 0 | 5 |
| Causal Reasoning from Meta-reinforcement Learning | Jan 23, 2019 | counterfactualMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach | Jun 21, 2022 | Density EstimationDimensionality Reduction | CodeCode Available | 0 | 5 |
| Meta-Reinforcement Learning via Buffering Graph Signatures for Live Video Streaming Events | Oct 3, 2021 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Meta Reinforcement Learning with Task Embedding and Shared Policy | May 16, 2019 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 | 5 |
| Meta-Reinforcement Learning by Tracking Task Non-stationarity | May 18, 2021 | Meta Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Meta-Reinforcement Learning for Reliable Communication in THz/VLC Wireless VR Networks | Jan 29, 2021 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Meta reinforcement learning as task inference | May 15, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Meta-Reinforcement Learning in Broad and Non-Parametric Environments | Aug 8, 2021 | Meta Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation | Mar 12, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 | 5 |
| Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning | Apr 4, 2022 | BIG-bench Machine LearningInductive Bias | CodeCode Available | 0 | 5 |
| Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks | May 24, 2025 | Meta Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Meta-Learning of Structured Task Distributions in Humans and Machines | Oct 5, 2020 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Meta Policy Learning for Cold-Start Conversational Recommendation | May 24, 2022 | Conversational RecommendationMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Meta-Q-Learning | Sep 30, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills | Dec 11, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning | Sep 23, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning to reinforcement learn | Nov 17, 2016 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 | 5 |
| Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning | Dec 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Adaptable image quality assessment using meta-reinforcement learning of task amenability | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning | Feb 4, 2025 | Meta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Jun 24, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Introducing Neuromodulation in Deep Neural Networks to Learn Adaptive Behaviours | Dec 21, 2018 | Meta Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |