| Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations | Jun 9, 2022 | Benchmarkingcontinuous-control | CodeCode Available | 2 |
| D4RL: Datasets for Deep Data-Driven Reinforcement Learning | Apr 15, 2020 | D4RLOffline RL | CodeCode Available | 2 |
| Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Feb 21, 2024 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization | May 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| Flowformer: Linearizing Transformers with Conservation Flows | Feb 13, 2022 | D4RLOffline RL | CodeCode Available | 2 |
| Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning | Aug 12, 2022 | D4RLOffline RL | CodeCode Available | 2 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 |
| All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL | Feb 24, 2022 | AllImitation Learning | CodeCode Available | 1 |
| Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation | Jul 10, 2023 | Decision MakingInteractive Recommendation | CodeCode Available | 1 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 |
| DataLight: Offline Data-Driven Traffic Signal Control | Mar 20, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| AdaCat: Adaptive Categorical Discretization for Autoregressive Models | Aug 3, 2022 | Density EstimationOffline RL | CodeCode Available | 1 |
| Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information | Oct 31, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Critic Regularized Regression | Jun 26, 2020 | Offline RLregression | CodeCode Available | 1 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Sep 22, 2023 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation | Apr 19, 2022 | Offline RLOff-policy evaluation | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Curriculum Offline Imitation Learning | Nov 3, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization | Jun 5, 2020 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Adversarially Trained Actor Critic for Offline Reinforcement Learning | Feb 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Workflow for Offline Model-Free Robotic Reinforcement Learning | Sep 22, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 |
| Conservative Q-Learning for Offline Reinforcement Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |