| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 | 0 |
| Emergent Agentic Transformer from Chain of Hindsight Experience | May 26, 2023 | D4RLImitation Learning | —Unverified | 0 | 0 |
| Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation | Nov 18, 2024 | D4RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 | 0 |
| Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery | Dec 2, 2022 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Sep 9, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | May 30, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Jul 17, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Dec 29, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |