SOTAVerified

Offline RL

Papers

Showing 451475 of 755 papers

TitleStatusHype
Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments0
Bi-Level Offline Policy Optimization with Limited Exploration0
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning0
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement LearningCode0
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration0
Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning0
Learning to Reach Goals via DiffusionCode0
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning0
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and SmoothnessCode0
Uncertainty-Aware Decision Transformer for Stochastic Driving Environments0
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills0
Robotic Offline RL from Internet Videos via Value-Function Pre-Training0
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps0
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning0
Model-based Offline Policy Optimization with Adversarial NetworkCode0
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance0
Multi-Objective Decision Transformers for Offline Reinforcement Learning0
Reinforced Self-Training (ReST) for Language Modeling0
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World0
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations0
Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation0
Contrastive Example-Based ControlCode0
A Connection between One-Step Regularization and Critic Regularization in Reinforcement LearningCode0
Show:102550
← PrevPage 19 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified