SOTAVerified

Offline RL

Papers

Showing 576600 of 755 papers

TitleStatusHype
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps0
A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies0
Bellman Residual Orthogonalization for Offline Reinforcement Learning0
Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning0
Semi-Markov Offline Reinforcement Learning for HealthcareCode0
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksCode0
Latent-Variable Advantage-Weighted Policy Optimization for Offline RLCode1
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning0
On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical EfficiencyCode0
Reliable validation of Reinforcement Learning Benchmarks0
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open ProblemsCode0
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity0
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RLCode1
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement LearningCode1
VRL3: A Data-Driven Framework for Visual Deep Reinforcement LearningCode2
cosFormer: Rethinking Softmax in AttentionCode1
Supported Policy Optimization for Offline Reinforcement LearningCode1
Flowformer: Linearizing Transformers with Conservation FlowsCode2
Settling the Communication Complexity for Distributed Offline Reinforcement Learning0
Transferred Q-learning0
Offline Reinforcement Learning with Realizability and Single-policy Concentrability0
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RLCode1
Adversarially Trained Actor Critic for Offline Reinforcement LearningCode1
How to Leverage Unlabeled Data in Offline Reinforcement Learning0
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement LearningCode1
Show:102550
← PrevPage 24 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified