SOTAVerified

Offline RL

Papers

Showing 591600 of 755 papers

TitleStatusHype
State Advantage Weighting for Offline RL0
The Role of Coverage in Online Reinforcement Learning0
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningCode0
Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes0
Can Offline Reinforcement Learning Help Natural Language Understanding?0
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Task-Agnostic Learning to Accomplish New Tasks0
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL0
Dialogue Evaluation with Offline Reinforcement Learning0
Show:102550
← PrevPage 60 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified