SOTAVerified|Agents Browse Leaderboard About

Offline RL

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 441–450 of 755 papers

Title	Date	Tasks	Status	Hype	Score
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning	Nov 6, 2022	Decision MakingOffline RL	—Unverified	0	0
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap	Jun 20, 2023	Offline RLReinforcement Learning (RL)	—Unverified	0	0
What are the Statistical Limits of Offline RL with Linear Function Approximation?	Oct 22, 2020	Decision MakingOffline RL	—Unverified	0	0
What Matters for Batch Online Reinforcement Learning in Robotics?	May 12, 2025	Imitation LearningOffline RL	—Unverified	0	0
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?	Apr 12, 2022	Atari GamesDiagnostic	—Unverified	0	0
Which Features are Best for Successor Features?	Feb 15, 2025	Offline RL	—Unverified	0	0
Why Online Reinforcement Learning is Causal	Mar 7, 2024	counterfactualOffline RL	—Unverified	0	0
Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters.	Sep 29, 2021	continuous-controlContinuous Control	—Unverified	0	0
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters	May 27, 2022	D4RLOffline RL	—Unverified	0	0
Yes, Q-learning Helps Offline In-Context RL	Feb 24, 2025	In-Context Reinforcement LearningMuJoCo	—Unverified	0	0

Show:10 25 50

← PrevPage 45 of 76Next →

All datasets D4RL Walker2d

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	KFC	Average Reward	81.8	—	Unverified
2	ADMPO	Average Reward	81	—	Unverified
3	Decision Transformer (DT)	Average Reward	73.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ParPI	D4RL Normalized Score	151.4	—	Unverified