SOTAVerified

Offline RL

Papers

Showing 171180 of 755 papers

TitleStatusHype
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree SearchCode1
Latent-Variable Advantage-Weighted Policy Optimization for Offline RLCode1
cosFormer: Rethinking Softmax in AttentionCode1
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of TrialsCode1
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
Are Expressive Models Truly Necessary for Offline RL?Code1
Direct Preference-based Policy Optimization without Reward ModelingCode1
Diffusion Policies creating a Trust Region for Offline Reinforcement LearningCode1
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationCode1
Efficient Diffusion Policies for Offline Reinforcement LearningCode1
Show:102550
← PrevPage 18 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified