SOTAVerified

Offline RL

Papers

Showing 701725 of 755 papers

TitleStatusHype
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL0
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning0
Offline Reinforcement Learning as Anti-Exploration0
Corruption-Robust Offline Reinforcement Learning0
Offline Inverse Reinforcement Learning0
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning0
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
Model-Based Offline Planning with Trajectory PruningCode0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism0
Regularized Behavior Value Estimation0
Offline Reinforcement Learning with Fisher Divergence Critic Regularization0
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
Instabilities of Offline RL with Pre-Trained Neural Representation0
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Show:102550
← PrevPage 29 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified