| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Sep 29, 2021 | Offline RLRecommendation Systems | —Unverified | 0 | 0 |
| Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning | Jun 8, 2025 | Offline RLQuestion Answering | —Unverified | 0 | 0 |
| Learning to Influence Human Behavior with Offline Reinforcement Learning | Mar 3, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 | 0 |
| Learning to View: Decision Transformers for Active Object Detection | Jan 23, 2023 | Active Object DetectionMotion Planning | —Unverified | 0 | 0 |
| Learning Value Functions from Undirected State-only Experience | Apr 26, 2022 | Future predictionImitation Learning | —Unverified | 0 | 0 |
| Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains | Apr 11, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Leveraging Offline Data in Online Reinforcement Learning | Nov 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments | Oct 13, 2023 | Active LearningOffline RL | —Unverified | 0 | 0 |
| LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning | Jul 5, 2023 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Language Decision Transformers with Exponential Tilt for Interactive Text Environments | Feb 10, 2023 | Offline RL | —Unverified | 0 | 0 |
| Measurement Scheduling for ICU Patients with Offline Reinforcement Learning | Feb 12, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning | Mar 14, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning | Apr 14, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization | Dec 7, 2023 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Model-Based Offline Planning | Aug 12, 2020 | modelOffline RL | —Unverified | 0 | 0 |
| Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation | Mar 26, 2025 | D4RLData Augmentation | —Unverified | 0 | 0 |
| Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds | Aug 16, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation | Oct 25, 2023 | Contrastive Learningmodel | —Unverified | 0 | 0 |
| Model Generation with Provable Coverability for Offline Reinforcement Learning | Jun 1, 2022 | Offline RLOut-of-Distribution Generalization | —Unverified | 0 | 0 |
| MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning | Jan 21, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 | 0 |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Feb 11, 2024 | Distributional Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 | 0 |