SOTAVerified

Offline RL

Papers

Showing 151175 of 755 papers

TitleStatusHype
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning0
A Simulation Benchmark for Autonomous Racing with Large-Scale Human DataCode2
Diffusion Models as Optimizers for Efficient Planning in Offline RLCode0
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender SystemsCode0
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning0
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning0
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning0
FOSP: Fine-tuning Offline Safe Policy through World Models0
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling0
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning0
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators0
Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning0
Preference Elicitation for Offline Reinforcement Learning0
Equivariant Offline Reinforcement Learning0
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing0
Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback0
The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation0
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningCode3
Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning0
SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets0
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning0
Is Value Learning Really the Main Bottleneck in Offline RL?Code3
A Dual Approach to Imitation Learning from Observations with Offline Datasets0
Augmenting Offline RL with Unlabeled Data0
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning0
Show:102550
← PrevPage 7 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified