SOTAVerified

Offline RL

Papers

Showing 441450 of 755 papers

TitleStatusHype
Neural Laplace Control for Continuous-time Delayed SystemsCode1
Behavior Proximal Policy OptimizationCode1
Swapped goal-conditioned offline reinforcement learningCode1
Dual RL: Unification and New Methods for Reinforcement and Imitation LearningCode1
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Language Decision Transformers with Exponential Tilt for Interactive Text Environments0
A Strong Baseline for Batch Imitation Learning0
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage0
Selective Uncertainty Propagation in Offline RL0
Revisiting Bellman Errors for Offline Model SelectionCode0
Show:102550
← PrevPage 45 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified