| GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning | May 27, 2024 | Data AugmentationDecision Making | CodeCode Available | 1 |
| Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search | May 24, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 1 |
| Reinformer: Max-Return Sequence Modeling for Offline RL | May 14, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning | May 7, 2024 | Offline RLRobot Manipulation | CodeCode Available | 1 |
| Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning | Apr 30, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Feb 11, 2024 | Offline RL | CodeCode Available | 1 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning | Feb 4, 2024 | Meta Reinforcement LearningOffline RL | CodeCode Available | 1 |
| ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update | Feb 1, 2024 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Online Symbolic Music Alignment with Offline Reinforcement Learning | Dec 31, 2023 | Dynamic Time WarpingOffline RL | CodeCode Available | 1 |
| PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning | Dec 26, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach | Dec 12, 2023 | Knowledge DistillationOffline RL | CodeCode Available | 1 |
| The Generalization Gap in Offline Reinforcement Learning | Dec 10, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation | Nov 30, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 1 |
| Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning | Oct 31, 2023 | Few-Shot LearningOffline RL | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Towards Robust Offline Reinforcement Learning under Diverse Data Corruption | Oct 19, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias | Oct 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL | Oct 6, 2023 | AttributeOffline RL | CodeCode Available | 1 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning | Sep 29, 2023 | Image GenerationOffline RL | CodeCode Available | 1 |
| Zero-Shot Reinforcement Learning from Low Quality Data | Sep 26, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Sep 22, 2023 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning | Sep 14, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Reasoning with Latent Diffusion in Offline Reinforcement Learning | Sep 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning | Sep 6, 2023 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Towards Self-Assembling Artificial Neural Networks through Neural Developmental Programs | Jul 17, 2023 | Offline RL | CodeCode Available | 1 |
| Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning | Jul 13, 2023 | BenchmarkingOffline RL | CodeCode Available | 1 |
| Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation | Jul 10, 2023 | Decision MakingInteractive Recommendation | CodeCode Available | 1 |
| Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning | Jul 1, 2023 | D4RLmodel | CodeCode Available | 1 |
| Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting | Jun 22, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning | Jun 22, 2023 | Data AugmentationOffline RL | CodeCode Available | 1 |
| Policy Regularization with Dataset Constraint for Offline Reinforcement Learning | Jun 11, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Decoupled Prioritized Resampling for Offline RL | Jun 8, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL | Jun 7, 2023 | Data AugmentationOffline RL | CodeCode Available | 1 |
| Improving and Benchmarking Offline Reinforcement Learning Algorithms | Jun 1, 2023 | AttributeBenchmarking | CodeCode Available | 1 |
| Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation | May 31, 2023 | D4RLLanguage Modelling | CodeCode Available | 1 |
| Efficient Diffusion Policies for Offline Reinforcement Learning | May 31, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| MADiff: Offline Multi-agent Learning with Diffusion Models | May 27, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models | May 24, 2023 | Language ModellingOffline RL | CodeCode Available | 1 |
| Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning | May 24, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 |
| Revisiting the Minimalist Approach to Offline Reinforcement Learning | May 16, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Federated Ensemble-Directed Offline Reinforcement Learning | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare | May 2, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies | Apr 20, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |