SOTAVerified

Offline RL

Papers

Showing 501525 of 755 papers

TitleStatusHype
Boosting Offline Reinforcement Learning via Data Rebalancing0
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data0
A Policy-Guided Imitation Approach for Offline Reinforcement LearningCode1
Mutual Information Regularized Offline Reinforcement LearningCode0
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics BeliefCode0
Semi-Supervised Offline Reinforcement Learning with Action-Free TrajectoriesCode1
Efficient Offline Policy Optimization with a Learned ModelCode1
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of TrialsCode1
Reliable Conditioning of Behavioral Cloning for Offline Reinforcement LearningCode1
The Role of Coverage in Online Reinforcement Learning0
State Advantage Weighting for Offline RL0
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning DatasetsCode1
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningCode0
VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-TrainingCode1
Offline Reinforcement Learning via High-Fidelity Generative Behavior ModelingCode1
Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes0
Can Offline Reinforcement Learning Help Natural Language Understanding?0
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward ShapingCode1
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Task-Agnostic Learning to Accomplish New Tasks0
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL0
Dialogue Evaluation with Offline Reinforcement Learning0
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments0
Efficient Planning in a Compact Latent Action SpaceCode1
Show:102550
← PrevPage 21 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified