SOTAVerified

Offline RL

Papers

Showing 226250 of 755 papers

TitleStatusHype
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation0
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles0
Confidence-Conditioned Value Functions for Offline Reinforcement Learning0
Enhancing Reinforcement Learning Through Guided Search0
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits0
A Validation Tool for Designing Reinforcement Learning Environments0
Automatic Trade-off Adaptation in Offline RL0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective0
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention0
Federated Offline Reinforcement Learning0
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices0
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching0
Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting0
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions0
Finetuning Offline World Models in the Real World0
Enhanced DACER Algorithm with High Diffusion Efficiency0
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning0
Energy-Weighted Flow Matching for Offline Reinforcement Learning0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Show:102550
← PrevPage 10 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified