| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 |
| Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study | May 4, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | May 3, 2025 | D4RLOffline RL | —Unverified | 0 |
| Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator | Apr 23, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 |
| A Clean Slate for Offline Reinforcement Learning | Apr 15, 2025 | Offline RLreinforcement-learning | CodeCode Available | 3 |
| Towards Optimal Differentially Private Regret Bounds in Linear MDPs | Apr 12, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Decision SpikeFormer: Spike-Driven Transformer for Decision Making | Apr 4, 2025 | D4RLDecision Making | —Unverified | 0 |
| Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation | Mar 26, 2025 | D4RLData Augmentation | —Unverified | 0 |
| Offline Reinforcement Learning with Discrete Diffusion Skills | Mar 26, 2025 | DecoderOffline RL | —Unverified | 0 |