SOTAVerified

Offline RL

Papers

Showing 501550 of 755 papers

TitleStatusHype
Boosting Offline Reinforcement Learning via Data Rebalancing0
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data0
A Policy-Guided Imitation Approach for Offline Reinforcement LearningCode1
Mutual Information Regularized Offline Reinforcement LearningCode0
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics BeliefCode0
Semi-Supervised Offline Reinforcement Learning with Action-Free TrajectoriesCode1
Efficient Offline Policy Optimization with a Learned ModelCode1
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of TrialsCode1
Reliable Conditioning of Behavioral Cloning for Offline Reinforcement LearningCode1
The Role of Coverage in Online Reinforcement Learning0
State Advantage Weighting for Offline RL0
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning DatasetsCode1
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningCode0
VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-TrainingCode1
Offline Reinforcement Learning via High-Fidelity Generative Behavior ModelingCode1
Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes0
Can Offline Reinforcement Learning Help Natural Language Understanding?0
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward ShapingCode1
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Task-Agnostic Learning to Accomplish New Tasks0
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL0
Dialogue Evaluation with Offline Reinforcement Learning0
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments0
Efficient Planning in a Compact Latent Action SpaceCode1
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement LearningCode2
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity0
AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsCode1
Offline Reinforcement Learning at Multiple Frequencies0
Discriminator-Weighted Offline Imitation Learning from Suboptimal DemonstrationsCode1
BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion0
GriddlyJS: A Web IDE for Reinforcement Learning0
Offline Equilibrium FindingCode0
Offline RL Policies Should be Trained to be Adaptive0
An Empirical Study of Implicit Regularization in Deep Offline RL0
Prompting Decision Transformer for Few-Shot Policy Generalization0
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement LearningCode1
Behavior Transformers: Cloning k modes with one stoneCode1
A Survey on Model-based Reinforcement Learning0
Bootstrapped Transformer for Offline Reinforcement Learning0
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement LearningCode2
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based ImaginationCode0
Contrastive Learning as Goal-Conditioned Reinforcement Learning0
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningCode0
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward0
Provable Benefit of Multitask Representation Learning in Reinforcement Learning0
Federated Offline Reinforcement Learning0
Large-Scale Retrieval for Reinforcement Learning0
Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsCode2
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement LearningCode1
Show:102550
← PrevPage 11 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified