SOTAVerified

Offline RL

Papers

Showing 691700 of 755 papers

TitleStatusHype
Building Persona Consistent Dialogue Agents with Offline Reinforcement LearningCode0
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationCode0
The Role of Deep Learning Regularizations on Actors in Offline RLCode0
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsCode0
Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement LearningCode0
Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement LearningCode0
PyTupli: A Scalable Infrastructure for Collaborative Offline Reinforcement Learning ProjectsCode0
Mutual Information Regularized Offline Reinforcement LearningCode0
Think-J: Learning to Think for Generative LLM-as-a-JudgeCode0
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data CoverageCode0
Show:102550
← PrevPage 70 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified