SOTAVerified

Offline RL

Papers

Showing 241250 of 755 papers

TitleStatusHype
Measurement Scheduling for ICU Patients with Offline Reinforcement Learning0
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RLCode1
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning0
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices0
Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning0
Offline Actor-Critic Reinforcement Learning Scales to Large Models0
A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs0
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement LearningCode1
SEABO: A Simple Search-Based Method for Offline Imitation LearningCode1
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning0
Show:102550
← PrevPage 25 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified