SOTAVerified

Offline RL

Papers

Showing 91100 of 755 papers

TitleStatusHype
Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic PerspectiveCode2
Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization0
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement0
Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative TradingCode2
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble0
Preserving Expert-Level Privacy in Offline Reinforcement Learning0
Continual Task Learning through Adaptive Policy Self-CompositionCode0
Doubly Mild Generalization for Offline Reinforcement LearningCode1
Show:102550
← PrevPage 10 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified