SOTAVerified

Offline RL

Papers

Showing 211220 of 755 papers

TitleStatusHype
Compositional Conservatism: A Transductive Approach in Offline Reinforcement LearningCode0
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLCode0
On the Effectiveness of Offline RL for Dialogue Response GenerationCode0
On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical EfficiencyCode0
Off-policy Evaluation in Doubly Inhomogeneous EnvironmentsCode0
Offline RL With Resource Constrained Online DeploymentCode0
Offline RL with Smooth OOD Generalization in Convex Hull and its NeighborhoodCode0
Active Advantage-Aligned Online Reinforcement Learning with Offline DataCode0
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open ProblemsCode0
DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under UncertaintyCode0
Show:102550
← PrevPage 22 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified