SOTAVerified

Offline RL

Papers

Showing 376400 of 755 papers

TitleStatusHype
Contrastive Value Learning: Implicit Models for Simple Offline RL0
Corruption-Robust Offline Reinforcement Learning0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning0
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning0
Curriculum Offline Imitating Learning0
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning0
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning0
Data Center Cooling System Optimization Using Offline Reinforcement Learning0
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Consistent time travel for realistic interactions with historical data: reinforcement learning for market making0
Decision SpikeFormer: Spike-Driven Transformer for Decision Making0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Deep RL with Hierarchical Action Exploration for Dialogue Generation0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding0
Deploying Offline Reinforcement Learning with Human Feedback0
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization0
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm0
Dialogue Evaluation with Offline Reinforcement Learning0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning0
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching0
Diffused Task-Agnostic Milestone Planner0
Show:102550
← PrevPage 16 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified