SOTAVerified

Offline RL

Papers

Showing 541550 of 755 papers

TitleStatusHype
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement LearningCode0
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning0
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation0
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationCode0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Language Decision Transformers with Exponential Tilt for Interactive Text Environments0
A Strong Baseline for Batch Imitation Learning0
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage0
Selective Uncertainty Propagation in Offline RL0
Revisiting Bellman Errors for Offline Model SelectionCode0
Show:102550
← PrevPage 55 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified