SOTAVerified

Offline RL

Papers

Showing 231240 of 755 papers

TitleStatusHype
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
What Matters for Batch Online Reinforcement Learning in Robotics?0
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach0
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning0
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach0
Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study0
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning0
Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator0
Show:102550
← PrevPage 24 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified