SOTAVerified

Offline RL

Papers

Showing 151175 of 755 papers

TitleStatusHype
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Decision Transformer: Reinforcement Learning via Sequence ModelingCode1
Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse ShapesCode1
Critic Regularized RegressionCode1
Behavior Proximal Policy OptimizationCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward ShapingCode1
Critic-Guided Decision Transformer for Offline Reinforcement LearningCode1
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement LearningCode1
Deployment-Efficient Reinforcement Learning via Model-Based Offline OptimizationCode1
Offline Reinforcement Learning via High-Fidelity Generative Behavior ModelingCode1
Guiding Online Reinforcement Learning with Action-Free Offline PretrainingCode1
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement LearningCode1
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory WeightingCode1
cosFormer: Rethinking Softmax in AttentionCode1
A Policy-Guided Imitation Approach for Offline Reinforcement LearningCode1
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
Dual RL: Unification and New Methods for Reinforcement and Imitation LearningCode1
Improving and Benchmarking Offline Reinforcement Learning AlgorithmsCode1
Optimal Transport for Offline Imitation LearningCode1
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement LearningCode1
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationCode1
Offline Reinforcement Learning from Images with Latent Space ModelsCode1
Offline Reinforcement Learning with Implicit Q-LearningCode1
Show:102550
← PrevPage 7 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified