SOTAVerified

Offline RL

Papers

Showing 176200 of 755 papers

TitleStatusHype
Are Expressive Models Truly Necessary for Offline RL?Code1
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationCode1
Diffusion Policies creating a Trust Region for Offline Reinforcement LearningCode1
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning DatasetsCode1
MoCoDA: Model-based Counterfactual Data AugmentationCode1
Improving and Benchmarking Offline Reinforcement Learning AlgorithmsCode1
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement LearningCode1
Model-Bellman Inconsistency for Model-based Offline Reinforcement LearningCode1
Discriminator-Weighted Offline Imitation Learning from Suboptimal DemonstrationsCode1
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement LearningCode1
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-TuningCode1
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-PerformerCode1
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare SettingsCode1
DMC-VB: A Benchmark for Representation Learning for Control with Visual DistractorsCode1
MOPO: Model-based Offline Policy OptimizationCode1
Q-value Regularized Transformer for Offline Reinforcement LearningCode1
Neural Laplace Control for Continuous-time Delayed SystemsCode1
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement LearningCode1
Adversarially Trained Actor Critic for Offline Reinforcement LearningCode1
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RLCode1
Doubly Mild Generalization for Offline Reinforcement LearningCode1
Offline Meta-Reinforcement Learning with Advantage WeightingCode1
Supported Policy Optimization for Offline Reinforcement LearningCode1
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksCode0
On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical EfficiencyCode0
Show:102550
← PrevPage 8 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified