SOTAVerified

Offline RL

Papers

Showing 451475 of 755 papers

TitleStatusHype
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly0
Generative Probabilistic Planning for Optimizing Supply Chain Networks0
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning0
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning0
Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning0
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning0
Graph Decision Transformer0
GriddlyJS: A Web IDE for Reinforcement Learning0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps0
Harnessing Density Ratios for Online Reinforcement Learning0
H-GAP: Humanoid Control with a Generalist Planner0
How to Leverage Unlabeled Data in Offline Reinforcement Learning0
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation0
Human-centric Dialog Training via Offline Reinforcement Learning0
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance0
Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Hyperparameter Selection for Offline Reinforcement Learning0
Implicit Offline Reinforcement Learning via Supervised Learning0
Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning0
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization0
Show:102550
← PrevPage 19 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified