SOTAVerified

Offline RL

Papers

Showing 651660 of 755 papers

TitleStatusHype
Representation Balancing Offline Model-based Reinforcement Learning0
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLCode0
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender SystemsCode0
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement LearningCode0
Learning from Sparse Offline Datasets via Conservative Density EstimationCode0
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningCode0
On the Effectiveness of Offline RL for Dialogue Response GenerationCode0
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement LearningCode0
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement LearningCode0
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?Code0
Show:102550
← PrevPage 66 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified