SOTAVerified

Offline RL

Papers

Showing 426450 of 755 papers

TitleStatusHype
Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective0
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits0
Enhancing Reinforcement Learning Through Guided Search0
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles0
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning0
Equivariant Offline Reinforcement Learning0
Evaluation of Active Feature Acquisition Methods for Static Feature Settings0
Evaluation-Time Policy Switching for Offline Reinforcement Learning0
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations0
Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study0
A Tractable Inference Perspective of Offline RL0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
Federated Offline Reinforcement Learning0
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices0
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching0
Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting0
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions0
Finetuning Offline World Models in the Real World0
Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback0
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning0
FOSP: Fine-tuning Offline Safe Policy through World Models0
From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning0
Show:102550
← PrevPage 18 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified