SOTAVerified

Offline RL

Papers

Showing 376400 of 755 papers

TitleStatusHype
Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers0
SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets0
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration0
Settling the Communication Complexity for Distributed Offline Reinforcement Learning0
Settling the Sample Complexity of Model-Based Offline Reinforcement Learning0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters0
Single-Shot Pruning for Offline Reinforcement Learning0
Data-Incremental Continual Offline Reinforcement Learning0
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning0
SLiC-HF: Sequence Likelihood Calibration with Human Feedback0
Solving Continual Offline Reinforcement Learning with Decision Transformer0
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces0
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning0
SR-Reward: Taking The Path More Traveled0
State Advantage Weighting for Offline RL0
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning0
State Regularized Policy Optimization on Data with Dynamics Shift0
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments0
Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning0
Survival Instinct in Offline Reinforcement Learning0
Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning0
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach0
Show:102550
← PrevPage 16 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified