| Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data | May 14, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| What Matters for Batch Online Reinforcement Learning in Robotics? | May 12, 2025 | Imitation LearningOffline RL | —Unverified | 0 |
| Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach | May 10, 2025 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning | May 9, 2025 | D4RLOffline RL | —Unverified | 0 |
| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 |
| Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study | May 4, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | May 3, 2025 | D4RLOffline RL | —Unverified | 0 |
| Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator | Apr 23, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |