SOTAVerified

Offline RL

Papers

Showing 511520 of 755 papers

TitleStatusHype
State Advantage Weighting for Offline RL0
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning DatasetsCode1
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningCode0
VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-TrainingCode1
Offline Reinforcement Learning via High-Fidelity Generative Behavior ModelingCode1
Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes0
Can Offline Reinforcement Learning Help Natural Language Understanding?0
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward ShapingCode1
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Show:102550
← PrevPage 52 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified