SOTAVerified

D4RL

Papers

Showing 101150 of 226 papers

TitleStatusHype
DiffuserLite: Towards Real-time Diffusion Planning0
Solving Offline Reinforcement Learning with Decision Tree RegressionCode0
Exploration and Anti-Exploration with Distributional Random Network DistillationCode1
Learning from Sparse Offline Datasets via Conservative Density EstimationCode0
Critic-Guided Decision Transformer for Offline Reinforcement LearningCode1
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement LearningCode1
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
CROP: Conservative Reward for Model-based Offline Policy OptimizationCode1
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration BiasCode1
Score Regularized Policy Optimization through Diffusion BehaviorCode1
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement LearningCode0
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning0
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Pre-training with Synthetic Data Helps Offline Reinforcement LearningCode0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Reasoning with Latent Diffusion in Offline Reinforcement LearningCode1
Multi-Objective Decision Transformers for Offline Reinforcement Learning0
Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning0
Learning Computational Efficient Bots with Costly Features0
Offline Reinforcement Learning with On-Policy Q-Function Regularization0
Model-based Offline Reinforcement Learning with Count-based ConservatismCode0
Offline Diversity Maximization Under Imitation Constraints0
Budgeting Counterfactual for Offline RL0
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning0
Offline Reinforcement Learning with Imbalanced Datasets0
Elastic Decision Transformer0
Model-Bellman Inconsistency for Model-based Offline Reinforcement LearningCode1
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning0
CEIL: Generalized Contextual Imitation Learning0
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
Katakomba: Tools and Benchmarks for Data-Driven NetHackCode1
Curricular Subgoals for Inverse Reinforcement LearningCode1
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach0
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
Boosting Offline Reinforcement Learning with Action Preference Query0
Improving Offline RL by Blending Heuristics0
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control0
Improving and Benchmarking Offline Reinforcement Learning AlgorithmsCode1
Efficient Diffusion Policies for Offline Reinforcement LearningCode1
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal RepresentationCode1
Emergent Agentic Transformer from Chain of Hindsight Experience0
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
Revisiting the Minimalist Approach to Offline Reinforcement LearningCode1
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement LearningCode1
Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement LearningCode0
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value RegularizationCode1
Optimal Transport for Offline Imitation LearningCode1
Behavior Proximal Policy OptimizationCode1
Show:102550
← PrevPage 3 of 5Next →

No leaderboard results yet.