| When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Feb 15, 2023 | Autonomous Drivingcontinuous-control | CodeCode Available | 1 |
| Conservative State Value Estimation for Offline Reinforcement Learning | Feb 14, 2023 | D4RLreinforcement-learning | CodeCode Available | 0 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Anti-Exploration by Random Network Distillation | Jan 31, 2023 | D4RL | CodeCode Available | 1 |
| Improving Behavioural Cloning with Positive Unlabeled Learning | Jan 27, 2023 | Behavioural cloningD4RL | —Unverified | 0 |
| Model-based Offline Reinforcement Learning with Local Misspecification | Jan 26, 2023 | D4RLmodel | —Unverified | 0 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Model-based trajectory stitching for improved behavioural cloning and its applications | Dec 8, 2022 | Behavioural cloningBenchmarking | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery | Dec 2, 2022 | D4RLreinforcement-learning | —Unverified | 0 |
| Offline Reinforcement Learning with Closed-Form Policy Improvement Operators | Nov 29, 2022 | D4RLForm | —Unverified | 0 |
| Offline Reinforcement Learning with Adaptive Behavior Regularization | Nov 15, 2022 | D4RLOffline RL | —Unverified | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning | Oct 25, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation | Oct 19, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| CORL: Research-oriented Deep Offline Reinforcement Learning Library | Oct 13, 2022 | BenchmarkingD4RL | CodeCode Available | 3 |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Oct 13, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories | Oct 12, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 |
| Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization | Oct 7, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling | Sep 29, 2022 | Computational EfficiencyD4RL | CodeCode Available | 1 |
| DCE: Offline Reinforcement Learning With Double Conservative Estimates | Sep 27, 2022 | Computational EfficiencyD4RL | —Unverified | 0 |
| Hierarchical Decision Transformer | Sep 21, 2022 | D4RLreinforcement-learning | —Unverified | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning | Aug 12, 2022 | D4RLOffline RL | CodeCode Available | 2 |
| Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning | Jul 21, 2022 | Autonomous DrivingD4RL | —Unverified | 0 |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Jun 16, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning | Jun 9, 2022 | D4RLModel-based Reinforcement Learning | CodeCode Available | 1 |
| Mildly Conservative Q-Learning for Offline Reinforcement Learning | Jun 9, 2022 | D4RLQ-Learning | CodeCode Available | 1 |
| On the Role of Discount Factor in Offline Reinforcement Learning | Jun 7, 2022 | D4RLOffline RL | —Unverified | 0 |
| When does return-conditioned supervised learning work for offline reinforcement learning? | Jun 2, 2022 | D4RLreinforcement-learning | CodeCode Available | 1 |
| Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL | Jun 1, 2022 | D4RLOffline RL | —Unverified | 0 |
| Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters | May 27, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning | May 23, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning | Feb 23, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| A Behavior Regularized Implicit Policy for Offline Reinforcement Learning | Feb 19, 2022 | D4RLreinforcement-learning | —Unverified | 0 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 |
| Flowformer: Linearizing Transformers with Conservation Flows | Feb 13, 2022 | D4RLOffline RL | CodeCode Available | 2 |
| Online Decision Transformer | Feb 11, 2022 | D4RLEfficient Exploration | CodeCode Available | 2 |
| Adversarially Trained Actor Critic for Offline Reinforcement Learning | Feb 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| MOORe: Model-based Offline-to-Online Reinforcement Learning | Jan 25, 2022 | D4RLmodel | —Unverified | 0 |
| DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization | Dec 9, 2021 | Atari GamesD4RL | —Unverified | 0 |
| Quantile Filtered Imitation Learning | Dec 2, 2021 | D4RLImitation Learning | —Unverified | 0 |
| d3rlpy: An Offline Deep Reinforcement Learning Library | Nov 6, 2021 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| False Correlation Reduction for Offline Reinforcement Learning | Oct 24, 2021 | D4RLDecision Making | CodeCode Available | 1 |
| Offline Reinforcement Learning with Value-based Episodic Memory | Oct 19, 2021 | D4RLOffline RL | CodeCode Available | 1 |