SOTAVerified

Offline RL

Papers

Showing 131140 of 755 papers

TitleStatusHype
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward ShapingCode1
Efficient Planning in a Compact Latent Action SpaceCode1
AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsCode1
Discriminator-Weighted Offline Imitation Learning from Suboptimal DemonstrationsCode1
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement LearningCode1
Behavior Transformers: Cloning k modes with one stoneCode1
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement LearningCode1
RORL: Robust Offline Reinforcement Learning via Conservative SmoothingCode1
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement LearningCode1
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement LearningCode1
Show:102550
← PrevPage 14 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified