SOTAVerified

Offline RL

Papers

Showing 201210 of 755 papers

TitleStatusHype
Offline Reinforcement Learning from Datasets with Structured Non-StationarityCode0
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Offline RL via Feature-Occupancy Gradient Ascent0
Efficient Imitation Learning with Conservative World Models0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses0
Reinformer: Max-Return Sequence Modeling for Offline RLCode1
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning0
Improving Offline Reinforcement Learning with Inaccurate Simulators0
LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based PlanningCode1
Show:102550
← PrevPage 21 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified