SOTAVerified

Offline RL

Papers

Showing 671680 of 755 papers

TitleStatusHype
DCUR: Data Curriculum for Teaching via Samples with Reinforcement LearningCode0
Fat-to-Thin Policy Optimization: Offline RL with Sparse PoliciesCode0
Explaining RL Decisions with TrajectoriesCode0
Experimental evaluation of offline reinforcement learning for HVAC control in buildingsCode0
Offline Reinforcement Learning from Datasets with Structured Non-StationarityCode0
Policy-regularized Offline Multi-objective Reinforcement LearningCode0
POPO: Pessimistic Offline Policy OptimizationCode0
d3rlpy: An Offline Deep Reinforcement Learning LibraryCode0
Preference-Guided Reflective Sampling for Aligning Language ModelsCode0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Show:102550
← PrevPage 68 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified