SOTAVerified

Offline RL

Papers

Showing 426450 of 755 papers

TitleStatusHype
Advancing RAN Slicing with Offline Reinforcement Learning0
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning0
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization0
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman OperatorCode0
Diffused Task-Agnostic Milestone Planner0
Evaluation of Active Feature Acquisition Methods for Static Feature Settings0
H-GAP: Humanoid Control with a Generalist Planner0
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective0
Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning0
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning0
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets0
Offline Data Enhanced On-Policy Policy Gradient with Provable GuaranteesCode0
Rethinking Decision Transformer via Hierarchical Reinforcement Learning0
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity0
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving0
A Tractable Inference Perspective of Offline RL0
Robust Offline Reinforcement learning with Heavy-Tailed RewardsCode0
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data CoverageCode0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation0
Finetuning Offline World Models in the Real World0
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationCode0
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning0
Building Persona Consistent Dialogue Agents with Offline Reinforcement LearningCode0
End-to-end Offline Reinforcement Learning for Glycemia Control0
Show:102550
← PrevPage 18 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified