SOTAVerified

Offline RL

Papers

Showing 251275 of 755 papers

TitleStatusHype
The Virtues of Pessimism in Inverse Reinforcement Learning0
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching0
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement LearningCode1
Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning0
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient UpdateCode1
Context-Former: Stitching via Latent Conditioned Sequence Modeling0
Multi-Object Navigation in real environments using hybrid policies0
Differentiable Tree Search NetworkCode5
Solving Offline Reinforcement Learning with Decision Tree RegressionCode0
MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning0
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion ModelCode2
Harnessing Density Ratios for Online Reinforcement Learning0
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy LearningCode0
Learning from Sparse Offline Datasets via Conservative Density EstimationCode0
Solving Continual Offline Reinforcement Learning with Decision Transformer0
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization0
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement LearningCode0
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning0
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond0
Policy-regularized Offline Multi-objective Reinforcement LearningCode0
POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement LearningCode0
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning0
Online Symbolic Music Alignment with Offline Reinforcement LearningCode1
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement LearningCode1
Critic-Guided Decision Transformer for Offline Reinforcement LearningCode1
Show:102550
← PrevPage 11 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified