SOTAVerified

Offline RL

Papers

Showing 326350 of 755 papers

TitleStatusHype
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement LearningCode1
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps0
Robotic Offline RL from Internet Videos via Value-Function Pre-Training0
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning0
VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement LearningCode1
Reasoning with Latent Diffusion in Offline Reinforcement LearningCode1
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement LearningCode1
Model-based Offline Policy Optimization with Adversarial NetworkCode0
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance0
Multi-Objective Decision Transformers for Offline Reinforcement Learning0
Reinforced Self-Training (ReST) for Language Modeling0
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World0
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations0
AlphaStar Unplugged: Large-Scale Offline Reinforcement LearningCode2
Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation0
Contrastive Example-Based ControlCode0
A Connection between One-Step Regularization and Critic Regularization in Reinforcement LearningCode0
On the Effectiveness of Offline RL for Dialogue Response GenerationCode0
Model-based Offline Reinforcement Learning with Count-based ConservatismCode0
PASTA: Pretrained Action-State Transformer Agents0
Towards Self-Assembling Artificial Neural Networks through Neural Developmental ProgramsCode1
Robotic Manipulation Datasets for Offline Compositional Reinforcement LearningCode1
Budgeting Counterfactual for Offline RL0
Show:102550
← PrevPage 14 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified