| Enhancing Reinforcement Learning Through Guided Search | Aug 19, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Jun 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning | May 12, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 |
| Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning | Sep 14, 2023 | Data AugmentationOffline RL | —Unverified | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Dec 6, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Evaluation-Time Policy Switching for Offline Reinforcement Learning | Mar 15, 2025 | Behavioural cloningOffline RL | —Unverified | 0 |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations | Aug 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study | May 4, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 |
| Federated Offline Reinforcement Learning | Jun 11, 2022 | Offline RLPrivacy Preserving | —Unverified | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 |
| Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching | Jun 24, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 |
| Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions | Mar 30, 2023 | DiversityOffline RL | —Unverified | 0 |
| Finetuning Offline World Models in the Real World | Oct 24, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback | Jan 27, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning | Jun 26, 2025 | Action GenerationDecision Making | —Unverified | 0 |
| FOSP: Fine-tuning Offline Safe Policy through World Models | Jul 6, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Jul 17, 2025 | D4RLOffline RL | —Unverified | 0 |
| Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning | Feb 22, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly | Apr 26, 2024 | Contact-rich ManipulationOffline RL | —Unverified | 0 |
| Generative Probabilistic Planning for Optimizing Supply Chain Networks | Apr 11, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning | May 24, 2025 | GPUOffline RL | —Unverified | 0 |
| Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Dec 29, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning | Feb 16, 2024 | Metric LearningOffline RL | —Unverified | 0 |
| Goal-Conditioned Predictive Coding for Offline Reinforcement Learning | Jul 7, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| Graph Decision Transformer | Mar 7, 2023 | Offline RLOpenAI Gym | —Unverified | 0 |
| GriddlyJS: A Web IDE for Reinforcement Learning | Jul 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Oct 27, 2023 | Autonomous DrivingD4RL | —Unverified | 0 |
| H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Harnessing Density Ratios for Online Reinforcement Learning | Jan 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| H-GAP: Humanoid Control with a Generalist Planner | Dec 5, 2023 | Humanoid ControlModel Predictive Control | —Unverified | 0 |
| How to Leverage Unlabeled Data in Offline Reinforcement Learning | Feb 3, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation | May 6, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance | Sep 4, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs | Aug 8, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Hyperparameter Selection for Offline Reinforcement Learning | Jul 17, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Implicit Offline Reinforcement Learning via Supervised Learning | Oct 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning | Dec 31, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Dec 3, 2024 | ObjectOffline RL | —Unverified | 0 |
| Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning | Jun 1, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization | Dec 24, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Improving Offline Reinforcement Learning with Inaccurate Simulators | May 7, 2024 | D4RLGenerative Adversarial Network | —Unverified | 0 |
| Improving Offline RL by Blending Heuristics | Jun 1, 2023 | D4RLOffline RL | —Unverified | 0 |
| Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions | Nov 29, 2021 | Contrastive LearningDecision Making | —Unverified | 0 |