| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Near-Optimal Offline Reinforcement Learning via Double Variance Reduction | Feb 2, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning | Jul 7, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Jul 29, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning | May 29, 2025 | DenoisingMuJoCo | —Unverified | 0 |
| How to Provably Improve Return Conditioned Supervised Learning? | Jun 10, 2025 | Decision MakingOffline RL | —Unverified | 0 |
| Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Oct 15, 2024 | ARCDecision Making | —Unverified | 0 |
| Diffused Task-Agnostic Milestone Planner | Dec 6, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs | Feb 7, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Multi-Object Navigation in real environments using hybrid policies | Jan 24, 2024 | Imitation LearningObject | —Unverified | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 |
| BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Multi-Objective Decision Transformers for Offline Reinforcement Learning | Aug 31, 2023 | D4RLOffline RL | —Unverified | 0 |
| Leveraging Offline Data in Online Reinforcement Learning | Nov 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains | Apr 11, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Bootstrapped Transformer for Offline Reinforcement Learning | Jun 17, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning | Oct 15, 2024 | Collision AvoidanceOffline RL | —Unverified | 0 |
| Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning | Nov 12, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| Addressing Extrapolation Error in Deep Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Learning to View: Decision Transformers for Active Object Detection | Jan 23, 2023 | Active Object DetectionMotion Planning | —Unverified | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| Learning to Influence Human Behavior with Offline Reinforcement Learning | Mar 3, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm | Sep 24, 2024 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Feb 11, 2024 | Distributional Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| MOReL: Model-Based Offline Reinforcement Learning | Dec 1, 2020 | modelOffline RL | —Unverified | 0 |
| Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning | Jun 8, 2025 | Offline RLQuestion Answering | —Unverified | 0 |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Sep 29, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation | Jun 9, 2025 | Decision MakingMuJoCo | —Unverified | 0 |
| Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization | Jun 26, 2023 | Offline RLTest-time Adaptation | —Unverified | 0 |
| Learning Value Functions from Undirected State-only Experience | Apr 26, 2022 | Future predictionImitation Learning | —Unverified | 0 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills | Sep 24, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning | Jan 21, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Deploying Offline Reinforcement Learning with Human Feedback | Mar 13, 2023 | Decision MakingModel Selection | —Unverified | 0 |
| Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments | Oct 13, 2023 | Active LearningOffline RL | —Unverified | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning | Jul 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Language Decision Transformers with Exponential Tilt for Interactive Text Environments | Feb 10, 2023 | Offline RL | —Unverified | 0 |
| Large-Scale Retrieval for Reinforcement Learning | Jun 10, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding | Jun 1, 2023 | ManagementOffline RL | —Unverified | 0 |
| Bi-Level Offline Policy Optimization with Limited Exploration | Oct 10, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning | Jan 6, 2024 | Offline RLRobot Manipulation | —Unverified | 0 |
| Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game | May 31, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Large Language Model driven Policy Exploration for Recommender Systems | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |