SOTAVerified

Offline RL

Papers

Showing 276300 of 755 papers

TitleStatusHype
Mutual Information Regularized Offline Reinforcement LearningCode0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
Model-based Offline Reinforcement Learning with Count-based ConservatismCode0
Model-based Offline Policy Optimization with Adversarial NetworkCode0
d3rlpy: An Offline Deep Reinforcement Learning LibraryCode0
Model-Based Offline Planning with Trajectory PruningCode0
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman OperatorCode0
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from ObservationsCode0
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics BeliefCode0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement LearningCode0
Learning Versatile Skills with Curriculum MaskingCode0
Learning to Reach Goals via DiffusionCode0
Behavior Prior Representation learning for Offline Reinforcement LearningCode0
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RLCode0
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement LearningCode0
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationCode0
Learning from Sparse Offline Datasets via Conservative Density EstimationCode0
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?Code0
Behavior Estimation from Multi-Source Data for Offline Reinforcement LearningCode0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement LearningCode0
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement LearningCode0
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningCode0
Show:102550
← PrevPage 12 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified