SOTAVerified

Offline RL

Papers

Showing 701750 of 755 papers

TitleStatusHype
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL0
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning0
Offline Reinforcement Learning as Anti-Exploration0
Corruption-Robust Offline Reinforcement Learning0
Offline Inverse Reinforcement Learning0
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning0
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
Model-Based Offline Planning with Trajectory PruningCode0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism0
Regularized Behavior Value Estimation0
Offline Reinforcement Learning with Fisher Divergence Critic Regularization0
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
Instabilities of Offline RL with Pre-Trained Neural Representation0
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Offline Policy Optimization with Variance Regularization0
Uncertainty Weighted Offline Reinforcement Learning0
Robust Offline Reinforcement Learning from Low-Quality Data0
Representation Balancing Offline Model-based Reinforcement Learning0
Addressing Extrapolation Error in Deep Offline Reinforcement Learning0
BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning0
Is Pessimism Provably Efficient for Offline RL?0
POPO: Pessimistic Offline Policy OptimizationCode0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement LearningCode0
MOReL: Model-Based Offline Reinforcement Learning0
Offline Reinforcement Learning Hands-On0
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning0
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Learning Dexterous Manipulation from Suboptimal Experts0
Human-centric Dialog Training via Offline Reinforcement Learning0
The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line0
Model-Based Offline Planning0
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Hyperparameter Selection for Offline Reinforcement Learning0
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning0
Show:102550
← PrevPage 15 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified