| Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning | Jun 27, 2023 | D4RLOffline RL | —Unverified | 0 | 0 |
| Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Jan 15, 2025 | D4RLQ-Learning | —Unverified | 0 | 0 |
| Quantile Filtered Imitation Learning | Dec 2, 2021 | D4RLImitation Learning | —Unverified | 0 | 0 |
| Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Sep 12, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Reducing Conservativeness Oriented Offline Reinforcement Learning | Feb 27, 2021 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Apr 7, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Rethinking Optimal Transport in Offline Reinforcement Learning | Oct 17, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning | Oct 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning | Oct 9, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Oct 21, 2024 | ClusteringD4RL | —Unverified | 0 | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 | 0 |
| SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance | Oct 24, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Simple Ingredients for Offline Reinforcement Learning | Mar 19, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 | 0 |
| State-Action Joint Regularized Implicit Policy for Offline Reinforcement Learning | Sep 29, 2021 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| State-Constrained Offline Reinforcement Learning | May 23, 2024 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning | Aug 28, 2023 | D4RLOff-policy evaluation | —Unverified | 0 | 0 |
| STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation | May 27, 2025 | D4RLDenoising | —Unverified | 0 | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 | 0 |
| Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training | May 22, 2024 | AI AgentAutonomous Driving | —Unverified | 0 | 0 |