SOTAVerified

Offline RL

Papers

Showing 551575 of 755 papers

TitleStatusHype
Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning0
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach0
Targeted Environment Design from Offline Data0
The Challenges of Exploration for Offline Reinforcement Learning0
The Essential Elements of Offline RL via Supervised Learning0
The Least Restriction for Offline Reinforcement Learning0
The Pitfalls of Imitation Learning when Actions are Continuous0
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning0
The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line0
The Role of Coverage in Online Reinforcement Learning0
The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation0
The Value of Reward Lookahead in Reinforcement Learning0
The Virtues of Pessimism in Inverse Reinforcement Learning0
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning0
Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers0
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers0
Towards Generalizable Reinforcement Learning for Trade Execution0
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism0
Towards Optimal Differentially Private Regret Bounds in Linear MDPs0
Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning0
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses0
Tractable Offline Learning of Regular Decision Processes0
Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q^π-Realizability and Concentrability0
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding0
Transferred Q-learning0
Show:102550
← PrevPage 23 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified