| Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning | Sep 23, 2021 | LEMMAMuJoCo | CodeCode Available | 1 |
| UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | May 1, 2024 | Decision MakingMuJoCo | CodeCode Available | 1 |
| Cross-Modal Domain Adaptation for Reinforcement Learning | Jan 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| Unsupervised Skill Discovery with Bottleneck Option Learning | Jun 27, 2021 | DisentanglementMuJoCo | CodeCode Available | 1 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning | Oct 18, 2019 | Meta-LearningMuJoCo | CodeCode Available | 1 |
| Delay-Aware Model-Based Reinforcement Learning for Continuous Control | May 11, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| FM-TS: Flow Matching for Time Series Generation | Nov 12, 2024 | BenchmarkingImputation | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Offline Model-based Adaptable Policy Learning | Dec 1, 2021 | Decision Makingmodel | CodeCode Available | 1 |