| GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning | May 27, 2024 | Data AugmentationDecision Making | CodeCode Available | 1 |
| Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search | May 24, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 1 |
| Reinformer: Max-Return Sequence Modeling for Offline RL | May 14, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning | May 7, 2024 | Offline RLRobot Manipulation | CodeCode Available | 1 |
| Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning | Apr 30, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Feb 11, 2024 | Offline RL | CodeCode Available | 1 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning | Feb 4, 2024 | Meta Reinforcement LearningOffline RL | CodeCode Available | 1 |
| ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update | Feb 1, 2024 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Online Symbolic Music Alignment with Offline Reinforcement Learning | Dec 31, 2023 | Dynamic Time WarpingOffline RL | CodeCode Available | 1 |
| PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning | Dec 26, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach | Dec 12, 2023 | Knowledge DistillationOffline RL | CodeCode Available | 1 |
| The Generalization Gap in Offline Reinforcement Learning | Dec 10, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation | Nov 30, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 1 |
| Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning | Oct 31, 2023 | Few-Shot LearningOffline RL | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Towards Robust Offline Reinforcement Learning under Diverse Data Corruption | Oct 19, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias | Oct 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL | Oct 6, 2023 | AttributeOffline RL | CodeCode Available | 1 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning | Sep 29, 2023 | Image GenerationOffline RL | CodeCode Available | 1 |
| Zero-Shot Reinforcement Learning from Low Quality Data | Sep 26, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |