SOTAVerified

Offline RL

Papers

Showing 201225 of 755 papers

TitleStatusHype
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Offline Reinforcement Learning from Datasets with Structured Non-StationarityCode0
Offline RL via Feature-Occupancy Gradient Ascent0
Efficient Imitation Learning with Conservative World Models0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses0
Reinformer: Max-Return Sequence Modeling for Offline RLCode1
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning0
Improving Offline Reinforcement Learning with Inaccurate Simulators0
LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based PlanningCode1
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows0
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning0
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement LearningCode1
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly0
Offline Reinforcement Learning with Behavioral Supervisor Tuning0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
Data-Incremental Continual Offline Reinforcement Learning0
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning AgentsCode0
Offline Trajectory Generalization for Offline Reinforcement Learning0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains0
Generative Probabilistic Planning for Optimizing Supply Chain Networks0
Compositional Conservatism: A Transductive Approach in Offline Reinforcement LearningCode0
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning0
Scaling Vision-and-Language Navigation With Offline RL0
Show:102550
← PrevPage 9 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified