SOTAVerified

Offline RL

Papers

Showing 726750 of 755 papers

TitleStatusHype
Contrastive Example-Based ControlCode0
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and SmoothnessCode0
Diffusion Models as Optimizers for Efficient Planning in Offline RLCode0
Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMsCode0
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement LearningCode0
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman OperatorCode0
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy LearningCode0
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from ObservationsCode0
Revisiting Bellman Errors for Offline Model SelectionCode0
Unified Off-Policy Learning to Rank: a Reinforcement Learning PerspectiveCode0
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement LearningCode0
Continual Task Learning through Adaptive Policy Self-CompositionCode0
Learning Versatile Skills with Curriculum MaskingCode0
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Behavior Prior Representation learning for Offline Reinforcement LearningCode0
Compositional Conservatism: A Transductive Approach in Offline Reinforcement LearningCode0
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RLCode0
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement LearningCode0
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement LearningCode0
Decision Transformer under Random Frame DroppingCode0
Learning to Reach Goals via DiffusionCode0
The CoSTAR Block Stacking Dataset: Learning with Workspace ConstraintsCode0
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLCode0
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning AgentsCode0
Robust Offline Reinforcement learning with Heavy-Tailed RewardsCode0
Show:102550
← PrevPage 30 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified