| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 | 5 |
| Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes | Oct 12, 2021 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning | May 27, 2024 | Data AugmentationDecision Making | CodeCode Available | 1 | 5 |
| Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning | Feb 23, 2022 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 | 5 |
| Guiding Online Reinforcement Learning with Action-Free Offline Pretraining | Jan 30, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 | 5 |
| Critic Regularized Regression | Jun 26, 2020 | Offline RLregression | CodeCode Available | 1 | 5 |
| Dual RL: Unification and New Methods for Reinforcement and Imitation Learning | Feb 16, 2023 | Imitation LearningOffline RL | CodeCode Available | 1 | 5 |
| Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization | Jun 5, 2020 | Offline RLreinforcement-learning | CodeCode Available | 1 | 5 |
| Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models | May 24, 2023 | Language ModellingOffline RL | CodeCode Available | 1 | 5 |
| Latent-Variable Advantage-Weighted Policy Optimization for Offline RL | Mar 16, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| MOReL : Model-Based Offline Reinforcement Learning | May 12, 2020 | modelOffline RL | CodeCode Available | 1 | 5 |
| Behavior Proximal Policy Optimization | Feb 22, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Sep 22, 2023 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation | Jun 21, 2021 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Reinformer: Max-Return Sequence Modeling for Offline RL | May 14, 2024 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 | 5 |
| Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows | Nov 20, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 | 5 |
| Efficient Diffusion Policies for Offline Reinforcement Learning | May 31, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare | May 2, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 | 5 |
| COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation | Apr 19, 2022 | Offline RLOff-policy evaluation | CodeCode Available | 1 | 5 |
| Are Expressive Models Truly Necessary for Offline RL? | Dec 15, 2024 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | May 15, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 | 5 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Masked Autoencoding for Scalable and Generalizable Decision Making | Nov 23, 2022 | Decision MakingOffline RL | CodeCode Available | 1 | 5 |
| Optimal Transport for Offline Imitation Learning | Mar 24, 2023 | D4RLDecision Making | CodeCode Available | 1 | 5 |
| Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning | Apr 30, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets | Oct 7, 2022 | Autonomous DrivingBackdoor Attack | CodeCode Available | 1 | 5 |
| Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations | Jul 20, 2022 | Imitation LearningOffline RL | CodeCode Available | 1 | 5 |
| When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning | May 23, 2022 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Mar 9, 2023 | Offline RLQ-Learning | CodeCode Available | 1 | 5 |
| Q-value Regularized Transformer for Offline Reinforcement Learning | May 27, 2024 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings | Jul 23, 2021 | Computational EfficiencyDecision Making | CodeCode Available | 1 | 5 |
| DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors | Sep 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning | Jul 1, 2023 | D4RLmodel | CodeCode Available | 1 | 5 |
| An Optimistic Perspective on Offline Reinforcement Learning | Jul 10, 2019 | Atari GamesDiversity | CodeCode Available | 1 | 5 |
| NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning | Feb 1, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 | 5 |
| Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning | Jan 31, 2022 | DiversityOffline RL | CodeCode Available | 1 | 5 |
| Adversarially Trained Actor Critic for Offline Reinforcement Learning | Feb 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Supported Policy Optimization for Offline Reinforcement Learning | Feb 13, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 | 5 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 | 5 |
| Neural Laplace Control for Continuous-time Delayed Systems | Feb 24, 2023 | Model Predictive ControlOffline RL | CodeCode Available | 1 | 5 |
| ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update | Feb 1, 2024 | Imitation LearningOffline RL | CodeCode Available | 1 | 5 |
| COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks | Mar 16, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 | 5 |