SOTAVerified

Offline RL

Papers

Showing 701725 of 755 papers

TitleStatusHype
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
COMBO: Conservative Offline Model-Based Policy OptimizationCode1
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction0
NeoRL: A Near Real-World Benchmark for Offline Reinforcement LearningCode1
BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning0
Representation Balancing Offline Model-based Reinforcement Learning0
Uncertainty Weighted Offline Reinforcement Learning0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Robust Offline Reinforcement Learning from Low-Quality Data0
Offline Policy Optimization with Variance Regularization0
Addressing Extrapolation Error in Deep Offline Reinforcement Learning0
Is Pessimism Provably Efficient for Offline RL?0
POPO: Pessimistic Offline Policy OptimizationCode0
Offline Reinforcement Learning from Images with Latent Space ModelsCode1
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
MOReL: Model-Based Offline Reinforcement Learning0
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement LearningCode0
Offline Reinforcement Learning Hands-On0
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning0
Show:102550
← PrevPage 29 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified