SOTAVerified

Offline RL

Papers

Showing 741750 of 755 papers

TitleStatusHype
Offline Preference-Based Apprenticeship Learning0
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning0
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators0
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian0
Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning0
Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL0
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization0
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning0
Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning0
Oracle Inequalities for Model Selection in Offline Reinforcement Learning0
Show:102550
← PrevPage 75 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified