| CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Jun 18, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood | Jun 10, 2025 | Computational EfficiencyD4RL | CodeCode Available | 0 |
| Policy-Based Trajectory Clustering in Offline Reinforcement Learning | Jun 10, 2025 | ClusteringD4RL | —Unverified | 0 |
| STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation | May 27, 2025 | D4RLDenoising | —Unverified | 0 |
| Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL | May 26, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLmodel | —Unverified | 0 |
| Imagination-Limited Q-Learning for Offline Reinforcement Learning | May 18, 2025 | D4RLQ-Learning | —Unverified | 0 |
| Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer | May 14, 2025 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |