SOTAVerified

Offline RL

Papers

Showing 421430 of 755 papers

TitleStatusHype
Policy-regularized Offline Multi-objective Reinforcement LearningCode0
POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement LearningCode0
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning0
Neural Network Approximation for Pessimistic Offline Reinforcement Learning0
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning0
Advancing RAN Slicing with Offline Reinforcement Learning0
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning0
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization0
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman OperatorCode0
Diffused Task-Agnostic Milestone Planner0
Show:102550
← PrevPage 43 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified