SOTAVerified

Offline RL

Papers

Showing 451460 of 755 papers

TitleStatusHype
Skill Decision TransformerCode0
Direct Preference-based Policy Optimization without Reward ModelingCode1
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation PoliciesCode0
Guiding Online Reinforcement Learning with Action-Free Offline PretrainingCode1
Learning to View: Decision Transformers for Active Object Detection0
Extreme Q-Learning: MaxEnt RL without EntropyCode1
Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives0
Benchmarks and Algorithms for Offline Preference-Based Reward Learning0
Offline Policy Optimization in RL with Variance Regularizaton0
Representation Learning in Deep RL via Discrete Information Bottleneck0
Show:102550
← PrevPage 46 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified