SOTAVerified

Offline RL

Papers

Showing 661670 of 755 papers

TitleStatusHype
On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical EfficiencyCode0
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained OptimizationCode0
Off-policy Evaluation in Doubly Inhomogeneous EnvironmentsCode0
Offline RL with Smooth OOD Generalization in Convex Hull and its NeighborhoodCode0
Offline RL With Resource Constrained Online DeploymentCode0
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement LearningCode0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy OptimizationCode0
Policy Constraint by Only Support Constraint for Offline Reinforcement LearningCode0
Show:102550
← PrevPage 67 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified