SOTAVerified

Offline RL

Papers

Showing 376400 of 755 papers

TitleStatusHype
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
Data-Incremental Continual Offline Reinforcement Learning0
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning AgentsCode0
Offline Trajectory Generalization for Offline Reinforcement Learning0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains0
Generative Probabilistic Planning for Optimizing Supply Chain Networks0
Compositional Conservatism: A Transductive Approach in Offline Reinforcement LearningCode0
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning0
Scaling Vision-and-Language Navigation With Offline RL0
Uncertainty-aware Distributional Offline Reinforcement Learning0
Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling0
The Value of Reward Lookahead in Reinforcement Learning0
Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning0
Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning0
Why Online Reinforcement Learning is Causal0
Offline Fictitious Self-Play for Competitive Games0
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
Offline Multi-task Transfer RL with Representational Penalization0
Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning0
Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning0
Measurement Scheduling for ICU Patients with Offline Reinforcement Learning0
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning0
Show:102550
← PrevPage 16 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified