SOTAVerified

Offline RL

Papers

Showing 7180 of 755 papers

TitleStatusHype
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning0
Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback0
Data Center Cooling System Optimization Using Offline Reinforcement Learning0
Fat-to-Thin Policy Optimization: Offline RL with Sparse PoliciesCode0
Large Language Model driven Policy Exploration for Recommender Systems0
DRDT3: Diffusion-Refined Decision Test-Time Training Model0
SR-Reward: Taking The Path More Traveled0
On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures0
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning0
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLCode0
Show:102550
← PrevPage 8 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified