SOTAVerified

Offline RL

Papers

Showing 701725 of 755 papers

TitleStatusHype
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open ProblemsCode0
Skill Decision TransformerCode0
Multi-Game Decision TransformersCode0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationCode0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?Code0
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksCode0
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
SOReL and TOReL: Two Methods for Fully Offline Reinforcement LearningCode0
Model-based Offline Reinforcement Learning with Count-based ConservatismCode0
Solving Offline Reinforcement Learning with Decision Tree RegressionCode0
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using SparsityCode0
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningCode0
Beyond Reward: Offline Preference-guided Policy OptimizationCode0
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement LearningCode0
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics BeliefCode0
Model-based Offline Policy Optimization with Adversarial NetworkCode0
Active Advantage-Aligned Online Reinforcement Learning with Offline DataCode0
Stabilizing Extreme Q-learning by Maclaurin ExpansionCode0
Model-Based Offline Planning with Trajectory PruningCode0
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based ImaginationCode0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation PoliciesCode0
Show:102550
← PrevPage 29 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified