| Pareto Policy Pool for Model-based Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens | Sep 14, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Policy-Based Trajectory Clustering in Offline Reinforcement Learning | Jun 10, 2025 | ClusteringD4RL | —Unverified | 0 | 0 |
| Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLmodel | —Unverified | 0 | 0 |
| Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning | May 9, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning | Jun 27, 2023 | D4RLOffline RL | —Unverified | 0 | 0 |
| Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Jan 15, 2025 | D4RLQ-Learning | —Unverified | 0 | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| Quantile Filtered Imitation Learning | Dec 2, 2021 | D4RLImitation Learning | —Unverified | 0 | 0 |
| Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Sep 12, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Reducing Conservativeness Oriented Offline Reinforcement Learning | Feb 27, 2021 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Apr 7, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Rethinking Optimal Transport in Offline Reinforcement Learning | Oct 17, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning | Oct 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning | Oct 9, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Oct 21, 2024 | ClusteringD4RL | —Unverified | 0 | 0 |
| Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation | Oct 19, 2022 | D4RLMuJoCo | —Unverified | 0 | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 | 0 |
| SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance | Oct 24, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Simple Ingredients for Offline Reinforcement Learning | Mar 19, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 | 0 |
| State-Action Joint Regularized Implicit Policy for Offline Reinforcement Learning | Sep 29, 2021 | D4RLreinforcement-learning | —Unverified | 0 | 0 |