SOTAVerified

Offline RL

Papers

Showing 376400 of 755 papers

TitleStatusHype
Policy Regularization with Dataset Constraint for Offline Reinforcement LearningCode1
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning0
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning0
Decoupled Prioritized Resampling for Offline RLCode1
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLCode1
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation0
State Regularized Policy Optimization on Data with Dynamics Shift0
Survival Instinct in Offline Reinforcement Learning0
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding0
Improving and Benchmarking Offline Reinforcement Learning AlgorithmsCode1
Improving Offline RL by Blending Heuristics0
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control0
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning0
Efficient Diffusion Policies for Offline Reinforcement LearningCode1
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal RepresentationCode1
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?Code0
Robust Reinforcement Learning Objectives for Sequential Recommender SystemsCode0
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism0
MADiff: Offline Multi-agent Learning with Diffusion ModelsCode1
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement LearningCode0
Beyond Reward: Offline Preference-guided Policy OptimizationCode0
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement LearningCode1
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language ModelsCode1
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
Show:102550
← PrevPage 16 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified