SOTAVerified

Offline RL

Papers

Showing 461470 of 755 papers

TitleStatusHype
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps0
Harnessing Density Ratios for Online Reinforcement Learning0
H-GAP: Humanoid Control with a Generalist Planner0
How to Leverage Unlabeled Data in Offline Reinforcement Learning0
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation0
Human-centric Dialog Training via Offline Reinforcement Learning0
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance0
Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Hyperparameter Selection for Offline Reinforcement Learning0
Show:102550
← PrevPage 47 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified