SOTAVerified

Offline RL

Papers

Showing 271280 of 755 papers

TitleStatusHype
POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement LearningCode0
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning0
Online Symbolic Music Alignment with Offline Reinforcement LearningCode1
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement LearningCode1
Critic-Guided Decision Transformer for Offline Reinforcement LearningCode1
Neural Network Approximation for Pessimistic Offline Reinforcement Learning0
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning0
Advancing RAN Slicing with Offline Reinforcement Learning0
Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL ApproachCode1
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning0
Show:102550
← PrevPage 28 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified