SOTAVerified

Offline RL

Papers

Showing 626650 of 755 papers

TitleStatusHype
Multi-Game Decision TransformersCode0
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters0
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes0
User-Interactive Offline Reinforcement Learning0
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation0
Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning0
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers0
Learning Value Functions from Undirected State-only Experience0
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?0
Settling the Sample Complexity of Model-Based Offline Reinforcement Learning0
A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies0
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps0
Bellman Residual Orthogonalization for Offline Reinforcement Learning0
Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning0
Semi-Markov Offline Reinforcement Learning for HealthcareCode0
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksCode0
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning0
On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical EfficiencyCode0
Reliable validation of Reinforcement Learning Benchmarks0
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open ProblemsCode0
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity0
Settling the Communication Complexity for Distributed Offline Reinforcement Learning0
Offline Reinforcement Learning with Realizability and Single-policy Concentrability0
Transferred Q-learning0
How to Leverage Unlabeled Data in Offline Reinforcement Learning0
Show:102550
← PrevPage 26 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified