SOTAVerified

Offline RL

Papers

Showing 701750 of 755 papers

TitleStatusHype
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
COMBO: Conservative Offline Model-Based Policy OptimizationCode1
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction0
NeoRL: A Near Real-World Benchmark for Offline Reinforcement LearningCode1
BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning0
Representation Balancing Offline Model-based Reinforcement Learning0
Uncertainty Weighted Offline Reinforcement Learning0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Robust Offline Reinforcement Learning from Low-Quality Data0
Offline Policy Optimization with Variance Regularization0
Addressing Extrapolation Error in Deep Offline Reinforcement Learning0
Is Pessimism Provably Efficient for Offline RL?0
POPO: Pessimistic Offline Policy OptimizationCode0
Offline Reinforcement Learning from Images with Latent Space ModelsCode1
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
MOReL: Model-Based Offline Reinforcement Learning0
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement LearningCode0
Offline Reinforcement Learning Hands-On0
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning0
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
Batch Exploration with Examples for Scalable Robotic Reinforcement LearningCode1
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Learning Dexterous Manipulation from Suboptimal Experts0
Human-centric Dialog Training via Offline Reinforcement Learning0
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior RegularizationCode1
Rethinking Attention with PerformersCode2
The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line0
Offline Meta-Reinforcement Learning with Advantage WeightingCode1
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning0
Model-Based Offline Planning0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Hyperparameter Selection for Offline Reinforcement Learning0
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning0
Transformers are RNNs: Fast Autoregressive Transformers with Linear AttentionCode1
Critic Regularized RegressionCode1
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement LearningCode0
Conservative Q-Learning for Offline Reinforcement LearningCode1
Deployment-Efficient Reinforcement Learning via Model-Based Offline OptimizationCode1
Acme: A Research Framework for Distributed Reinforcement LearningCode1
MOPO: Model-based Offline Policy OptimizationCode1
MOReL : Model-Based Offline Reinforcement LearningCode1
D4RL: Datasets for Deep Data-Driven Reinforcement LearningCode2
Reformer: The Efficient TransformerCode2
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
Show:102550
← PrevPage 15 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified