| Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning | Feb 22, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 | 0 |
| Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly | Apr 26, 2024 | Contact-rich ManipulationOffline RL | —Unverified | 0 | 0 |
| Generative Probabilistic Planning for Optimizing Supply Chain Networks | Apr 11, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning | May 24, 2025 | GPUOffline RL | —Unverified | 0 | 0 |
| Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Dec 29, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |
| Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning | Feb 16, 2024 | Metric LearningOffline RL | —Unverified | 0 | 0 |
| Goal-Conditioned Predictive Coding for Offline Reinforcement Learning | Jul 7, 2023 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Graph Decision Transformer | Mar 7, 2023 | Offline RLOpenAI Gym | —Unverified | 0 | 0 |
| GriddlyJS: A Web IDE for Reinforcement Learning | Jul 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Oct 27, 2023 | Autonomous DrivingD4RL | —Unverified | 0 | 0 |
| H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Harnessing Density Ratios for Online Reinforcement Learning | Jan 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| H-GAP: Humanoid Control with a Generalist Planner | Dec 5, 2023 | Humanoid ControlModel Predictive Control | —Unverified | 0 | 0 |
| How to Leverage Unlabeled Data in Offline Reinforcement Learning | Feb 3, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation | May 6, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Human-centric Dialog Training via Offline Reinforcement Learning | Oct 12, 2020 | Language ModellingOffline RL | —Unverified | 0 | 0 |
| Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance | Sep 4, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs | Aug 8, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Hyperparameter Selection for Offline Reinforcement Learning | Jul 17, 2020 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Implicit Offline Reinforcement Learning via Supervised Learning | Oct 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning | Dec 31, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Dec 3, 2024 | ObjectOffline RL | —Unverified | 0 | 0 |
| Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning | Jun 1, 2021 | Offline RLRecommendation Systems | —Unverified | 0 | 0 |
| Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization | Dec 24, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Improving Offline Reinforcement Learning with Inaccurate Simulators | May 7, 2024 | D4RLGenerative Adversarial Network | —Unverified | 0 | 0 |
| Improving Offline RL by Blending Heuristics | Jun 1, 2023 | D4RLOffline RL | —Unverified | 0 | 0 |
| Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions | Nov 29, 2021 | Contrastive LearningDecision Making | —Unverified | 0 | 0 |
| InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem | May 2, 2021 | Atari GamesOffline RL | —Unverified | 0 | 0 |
| Iteratively Refined Behavior Regularization for Offline Reinforcement Learning | Jun 9, 2023 | D4RLOffline RL | —Unverified | 0 | 0 |
| Instabilities of Offline RL with Pre-Trained Neural Representation | Mar 8, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning | Jun 8, 2023 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning | Feb 19, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Integrating Domain Knowledge for handling Limited Data in Offline RL | Jun 11, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba | Aug 20, 2024 | MambaOffline RL | —Unverified | 0 | 0 |
| Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation | Jul 26, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm | Oct 13, 2024 | ManagementOffline RL | —Unverified | 0 | 0 |
| IntelliLung: Advancing Safe Mechanical Ventilation using Offline RL with Hybrid Actions and Clinically Aligned Rewards | Jun 17, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Interpretable performance analysis towards offline reinforcement learning: A dataset perspective | May 12, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory | May 29, 2024 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control | Jun 1, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 | 0 |
| Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective | Nov 29, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Is Pessimism Provably Efficient for Offline RL? | Dec 30, 2020 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 | 0 |
| Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL | Jun 1, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 | 0 |
| Language-Conditioned Offline RL for Multi-Robot Navigation | Jul 29, 2024 | Offline RLRobot Navigation | —Unverified | 0 | 0 |
| Large Language Model driven Policy Exploration for Recommender Systems | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Large-Scale Retrieval for Reinforcement Learning | Jun 10, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |