SOTAVerified

Offline RL

Papers

Showing 501550 of 755 papers

TitleStatusHype
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills0
Boosting Offline Reinforcement Learning via Data Rebalancing0
Boosting Offline Reinforcement Learning with Residual Generative Modeling0
Bootstrapped Transformer for Offline Reinforcement Learning0
BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning0
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism0
Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies0
Budgeting Counterfactual for Offline RL0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning0
Can Offline Reinforcement Learning Help Natural Language Understanding?0
Causal prompting model-based offline reinforcement learning0
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer0
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning0
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Confidence-Conditioned Value Functions for Offline Reinforcement Learning0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Context-Former: Stitching via Latent Conditioned Sequence Modeling0
Contextual Transformer for Offline Meta Reinforcement Learning0
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning0
Contrastive Learning as Goal-Conditioned Reinforcement Learning0
Contrastive Value Learning: Implicit Models for Simple Offline RL0
Corruption-Robust Offline Reinforcement Learning0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning0
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning0
Curriculum Offline Imitating Learning0
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning0
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning0
Data Center Cooling System Optimization Using Offline Reinforcement Learning0
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Consistent time travel for realistic interactions with historical data: reinforcement learning for market making0
Decision SpikeFormer: Spike-Driven Transformer for Decision Making0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Deep RL with Hierarchical Action Exploration for Dialogue Generation0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding0
Deploying Offline Reinforcement Learning with Human Feedback0
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization0
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm0
Dialogue Evaluation with Offline Reinforcement Learning0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning0
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching0
Diffused Task-Agnostic Milestone Planner0
Show:102550
← PrevPage 11 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified