| Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Oct 15, 2024 | ARCDecision Making | —Unverified | 0 | 0 |
| Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Jul 29, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning | Jul 10, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Diffusion Self-Weighted Guidance for Offline Reinforcement Learning | May 23, 2025 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning | Jun 10, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity | Aug 11, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation | Sep 14, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches | Feb 13, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Domain Adaptation for Offline Reinforcement Learning with Limited Samples | Aug 22, 2024 | Domain AdaptationOffline RL | —Unverified | 0 | 0 |
| Domain Generalization for Robust Model-Based Offline Reinforcement Learning | Nov 27, 2022 | Domain GeneralizationOffline RL | —Unverified | 0 | 0 |
| DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning | Sep 16, 2023 | D4RLmodel | —Unverified | 0 | 0 |
| Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage | May 16, 2023 | Offline RL | —Unverified | 0 | 0 |
| DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization | Dec 9, 2021 | Atari GamesD4RL | —Unverified | 0 | 0 |
| DRDT3: Diffusion-Refined Decision Test-Time Training Model | Jan 12, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Dual Generator Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Efficient Imitation Learning with Conservative World Models | May 21, 2024 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only | May 22, 2025 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 | 0 |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Apr 15, 2024 | GPUOffline RL | —Unverified | 0 | 0 |
| Enabling A Network AI Gym for Autonomous Cyber Agents | Apr 3, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient | Dec 7, 2017 | DecoderGoal-Oriented Dialog | —Unverified | 0 | 0 |
| End-to-end Offline Reinforcement Learning for Glycemia Control | Oct 16, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Energy-Weighted Flow Matching for Offline Reinforcement Learning | Mar 6, 2025 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Enhanced DACER Algorithm with High Diffusion Efficiency | May 29, 2025 | DenoisingImitation Learning | —Unverified | 0 | 0 |
| Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention | Sep 11, 2024 | Offline RL | —Unverified | 0 | 0 |
| Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective | Feb 17, 2025 | Bayesian Optimizationmodel | —Unverified | 0 | 0 |
| Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits | Feb 7, 2025 | InformativenessOffline RL | —Unverified | 0 | 0 |
| Enhancing Reinforcement Learning Through Guided Search | Aug 19, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Jun 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning | May 12, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 | 0 |
| Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning | Sep 14, 2023 | Data AugmentationOffline RL | —Unverified | 0 | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Dec 6, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Evaluation-Time Policy Switching for Offline Reinforcement Learning | Mar 15, 2025 | Behavioural cloningOffline RL | —Unverified | 0 | 0 |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations | Aug 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study | May 4, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Federated Offline Reinforcement Learning | Jun 11, 2022 | Offline RLPrivacy Preserving | —Unverified | 0 | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 | 0 |
| Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching | Jun 24, 2023 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions | Mar 30, 2023 | DiversityOffline RL | —Unverified | 0 | 0 |
| Finetuning Offline World Models in the Real World | Oct 24, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback | Jan 27, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning | Jun 26, 2025 | Action GenerationDecision Making | —Unverified | 0 | 0 |
| FOSP: Fine-tuning Offline Safe Policy through World Models | Jul 6, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Jul 17, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |