| Learning to Influence Human Behavior with Offline Reinforcement Learning | Mar 3, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm | Sep 24, 2024 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning | Jun 8, 2025 | Offline RLQuestion Answering | —Unverified | 0 |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Sep 29, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization | Jun 26, 2023 | Offline RLTest-time Adaptation | —Unverified | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills | Sep 24, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |