SOTAVerified

Offline RL

Papers

Showing 476500 of 755 papers

TitleStatusHype
Domain Generalization for Robust Model-Based Offline Reinforcement Learning0
Masked Autoencoding for Scalable and Generalizable Decision MakingCode1
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation0
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement LearningCode0
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing FlowsCode1
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch SizeCode1
Contextual Transformer for Offline Meta Reinforcement Learning0
Offline Reinforcement Learning with Adaptive Behavior Regularization0
Leveraging Offline Data in Online Reinforcement Learning0
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data0
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning0
Contrastive Value Learning: Implicit Models for Simple Offline RL0
Oracle Inequalities for Model Selection in Offline Reinforcement Learning0
Dual Generator Offline Reinforcement Learning0
Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints0
Behavior Prior Representation learning for Offline Reinforcement LearningCode0
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian0
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Agent-Controller Representations: Principled Offline RL with Rich Exogenous InformationCode1
Leveraging Demonstrations with Latent Space PriorsCode1
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement LearningCode1
Implicit Offline Reinforcement Learning via Supervised Learning0
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement LearningCode0
MoCoDA: Model-based Counterfactual Data AugmentationCode1
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation0
Show:102550
← PrevPage 20 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified