SOTAVerified

Offline RL

Papers

Showing 671680 of 755 papers

TitleStatusHype
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
Offline Reinforcement Learning for Large Scale Language Action Spaces0
Reward Shifting for Optimistic Exploration and Conservative Exploitation0
Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning0
Targeted Environment Design from Offline Data0
The Essential Elements of Offline RL via Supervised Learning0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
Show:102550
← PrevPage 68 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified