| Latent-Variable Advantage-Weighted Policy Optimization for Offline RL | Mar 16, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows | Nov 20, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| CIRS: Bursting Filter Bubbles by Counterfactual Interactive Recommender System | Apr 4, 2022 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| Curriculum Offline Imitation Learning | Nov 3, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning | May 24, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning | May 7, 2024 | Offline RLRobot Manipulation | CodeCode Available | 1 |
| COMBO: Conservative Offline Model-Based Policy Optimization | Feb 16, 2021 | modelOffline RL | CodeCode Available | 1 |
| Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information | Oct 31, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets | Oct 7, 2022 | Autonomous DrivingBackdoor Attack | CodeCode Available | 1 |
| MoCoDA: Model-based Counterfactual Data Augmentation | Oct 20, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| A Workflow for Offline Model-Free Robotic Reinforcement Learning | Sep 22, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Conservative Q-Learning for Offline Reinforcement Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Zero-Shot Reinforcement Learning from Low Quality Data | Sep 26, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning | Oct 11, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning | Sep 29, 2023 | Image GenerationOffline RL | CodeCode Available | 1 |
| Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning | Dec 25, 2024 | Decision MakingOffline RL | CodeCode Available | 1 |
| NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios | Mar 25, 2025 | BenchmarkingOffline RL | CodeCode Available | 1 |
| Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation | Jul 10, 2023 | Decision MakingInteractive Recommendation | CodeCode Available | 1 |
| Offline Meta-Reinforcement Learning with Advantage Weighting | Aug 13, 2020 | Machine TranslationMeta-Learning | CodeCode Available | 1 |
| A Minimalist Approach to Offline Reinforcement Learning | Jun 12, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Decoupled Prioritized Resampling for Offline RL | Jun 8, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL | Feb 24, 2022 | AllImitation Learning | CodeCode Available | 1 |
| Offline Reinforcement Learning for Visual Navigation | Dec 16, 2022 | NavigateOffline RL | CodeCode Available | 1 |
| Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization | Jun 5, 2020 | Offline RLreinforcement-learning | CodeCode Available | 1 |