SOTAVerified

Offline RL

Papers

Showing 351400 of 755 papers

TitleStatusHype
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning0
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning0
Curriculum Offline Imitating Learning0
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning0
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning0
Data Center Cooling System Optimization Using Offline Reinforcement Learning0
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Consistent time travel for realistic interactions with historical data: reinforcement learning for market making0
Decision SpikeFormer: Spike-Driven Transformer for Decision Making0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Deep RL with Hierarchical Action Exploration for Dialogue Generation0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding0
Deploying Offline Reinforcement Learning with Human Feedback0
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization0
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm0
Dialogue Evaluation with Offline Reinforcement Learning0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning0
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching0
Diffused Task-Agnostic Milestone Planner0
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning0
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning0
Diffusion Self-Weighted Guidance for Offline Reinforcement Learning0
Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning0
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity0
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches0
Domain Adaptation for Offline Reinforcement Learning with Limited Samples0
Domain Generalization for Robust Model-Based Offline Reinforcement Learning0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
DRDT3: Diffusion-Refined Decision Test-Time Training Model0
Dual Generator Offline Reinforcement Learning0
Efficient Imitation Learning with Conservative World Models0
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Enabling A Network AI Gym for Autonomous Cyber Agents0
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient0
End-to-end Offline Reinforcement Learning for Glycemia Control0
Energy-Weighted Flow Matching for Offline Reinforcement Learning0
Enhanced DACER Algorithm with High Diffusion Efficiency0
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention0
Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective0
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits0
Show:102550
← PrevPage 8 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified