SOTAVerified

Offline RL

Papers

Showing 151175 of 755 papers

TitleStatusHype
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Decision Transformer: Reinforcement Learning via Sequence ModelingCode1
Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse ShapesCode1
Dual RL: Unification and New Methods for Reinforcement and Imitation LearningCode1
Behavior Proximal Policy OptimizationCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Critic-Guided Decision Transformer for Offline Reinforcement LearningCode1
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement LearningCode1
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLCode1
Deployment-Efficient Reinforcement Learning via Model-Based Offline OptimizationCode1
MADiff: Offline Multi-agent Learning with Diffusion ModelsCode1
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning DatasetsCode1
MoCoDA: Model-based Counterfactual Data AugmentationCode1
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare SettingsCode1
MOPO: Model-based Offline Policy OptimizationCode1
A Policy-Guided Imitation Approach for Offline Reinforcement LearningCode1
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior RegularizationCode1
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient UpdateCode1
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RLCode1
cosFormer: Rethinking Softmax in AttentionCode1
Decoupled Prioritized Resampling for Offline RLCode1
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 DiabetesCode1
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationCode1
Show:102550
← PrevPage 7 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified