| Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches | Feb 13, 2025 | D4RLOffline RL | —Unverified | 0 |
| Habitizing Diffusion Planning for Efficient and Effective Decision Making | Feb 10, 2025 | CPUD4RL | CodeCode Available | 1 |
| Skill Expansion and Composition in Parameter Space | Feb 9, 2025 | D4RL | CodeCode Available | 2 |
| Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning | Feb 7, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 |
| Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network | Feb 1, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Jan 15, 2025 | D4RLQ-Learning | —Unverified | 0 |
| DRDT3: Diffusion-Refined Decision Test-Time Training Model | Jan 12, 2025 | D4RLOffline RL | —Unverified | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 |