| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies | Apr 20, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| CIRS: Bursting Filter Bubbles by Counterfactual Interactive Recommender System | Apr 4, 2022 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models | May 24, 2023 | Language ModellingOffline RL | CodeCode Available | 1 |
| Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning | May 24, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| A Minimalist Approach to Offline Reinforcement Learning | Jun 12, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| COMBO: Conservative Offline Model-Based Policy Optimization | Feb 16, 2021 | modelOffline RL | CodeCode Available | 1 |
| Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information | Oct 31, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Sep 22, 2023 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Behavior Transformers: Cloning k modes with one stone | Jun 22, 2022 | Object DetectionOffline RL | CodeCode Available | 1 |