SOTAVerified

Offline RL

Papers

Showing 321330 of 755 papers

TitleStatusHype
Context-Former: Stitching via Latent Conditioned Sequence Modeling0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting0
A Tractable Inference Perspective of Offline RL0
Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study0
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Evaluation-Time Policy Switching for Offline Reinforcement Learning0
Evaluation of Active Feature Acquisition Methods for Static Feature Settings0
Show:102550
← PrevPage 33 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified