SOTAVerified

Offline RL

Papers

Showing 151200 of 755 papers

TitleStatusHype
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement LearningCode1
Can Wikipedia Help Offline Reinforcement Learning?Code1
RvS: What is Essential for Offline RL via Supervised Learning?Code1
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC TasksCode1
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor RectificationCode1
A Dataset Perspective on Offline Reinforcement LearningCode1
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement LearningCode1
Curriculum Offline Imitation LearningCode1
False Correlation Reduction for Offline Reinforcement LearningCode1
Offline Reinforcement Learning with Value-based Episodic MemoryCode1
Safe Driving via Expert Guided Policy OptimizationCode1
Planning from Pixels in Environments with Combinatorially Hard Search SpacesCode1
Offline Reinforcement Learning with Implicit Q-LearningCode1
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement LearningCode1
Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse ShapesCode1
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-EnsembleCode1
Offline Reinforcement Learning with Reverse Model-based ImaginationCode1
Offline Reinforcement Learning with In-sample Q-LearningCode1
A Workflow for Offline Model-Free Robotic Reinforcement LearningCode1
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare SettingsCode1
Conservative Offline Distributional Reinforcement LearningCode1
Offline Meta-Reinforcement Learning with Online Self-SupervisionCode1
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-EnsembleCode1
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction EstimationCode1
Offline RL Without Off-Policy EvaluationCode1
Reinforcement Learning as One Big Sequence Modeling ProblemCode1
A Minimalist Approach to Offline Reinforcement LearningCode1
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement LearningCode1
Online reinforcement learning with sparse rewards through an active inference capsuleCode1
Offline Reinforcement Learning as One Big Sequence Modeling ProblemCode1
Decision Transformer: Reinforcement Learning via Sequence ModelingCode1
Uncertainty Weighted Actor-Critic for Offline Reinforcement LearningCode1
Online and Offline Reinforcement Learning by Planning with a Learned ModelCode1
COMBO: Conservative Offline Model-Based Policy OptimizationCode1
NeoRL: A Near Real-World Benchmark for Offline Reinforcement LearningCode1
Offline Reinforcement Learning from Images with Latent Space ModelsCode1
Batch Exploration with Examples for Scalable Robotic Reinforcement LearningCode1
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior RegularizationCode1
Offline Meta-Reinforcement Learning with Advantage WeightingCode1
Transformers are RNNs: Fast Autoregressive Transformers with Linear AttentionCode1
Critic Regularized RegressionCode1
Conservative Q-Learning for Offline Reinforcement LearningCode1
Deployment-Efficient Reinforcement Learning via Model-Based Offline OptimizationCode1
Acme: A Research Framework for Distributed Reinforcement LearningCode1
MOPO: Model-based Offline Policy OptimizationCode1
MOReL : Model-Based Offline Reinforcement LearningCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
An Optimistic Perspective on Offline Reinforcement LearningCode1
From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning0
Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMsCode0
Show:102550
← PrevPage 4 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified