| A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective | Mar 12, 2024 | D4RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning | Dec 4, 2024 | D4RLImitation Learning | CodeCode Available | 0 | 5 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 | 5 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| Directly Forecasting Belief for Reinforcement Learning with Delays | May 1, 2025 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| Solving Offline Reinforcement Learning with Decision Tree Regression | Jan 21, 2024 | D4RLFeature Importance | CodeCode Available | 0 | 5 |
| State-Constrained Offline Reinforcement Learning | May 23, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning | Aug 28, 2023 | D4RLOff-policy evaluation | —Unverified | 0 | 0 |
| STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation | May 27, 2025 | D4RLDenoising | —Unverified | 0 | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 | 0 |
| Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training | May 22, 2024 | AI AgentAutonomous Driving | —Unverified | 0 | 0 |
| Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Jun 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters | May 27, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL | Oct 5, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 | 0 |
| Accelerating Residual Reinforcement Learning with Uncertainty Estimation | Jun 21, 2025 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 | 0 |