| Contrastive Value Learning: Implicit Models for Simple Offline RL | Nov 3, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Corruption-Robust Offline Reinforcement Learning | Jun 11, 2021 | Adversarial RobustnessOffline RL | —Unverified | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning | Mar 29, 2024 | counterfactualOffline RL | —Unverified | 0 |
| CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning | Dec 19, 2023 | NavigateOffline RL | —Unverified | 0 |
| Curriculum Offline Imitating Learning | Dec 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning | Aug 15, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning | Mar 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Data Center Cooling System Optimization Using Offline Reinforcement Learning | Jan 25, 2025 | Graph Neural NetworkOffline RL | —Unverified | 0 |
| Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data | Oct 16, 2022 | Model SelectionOffline RL | —Unverified | 0 |
| Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning | Sep 29, 2021 | Multi-Task LearningOffline RL | —Unverified | 0 |
| Consistent time travel for realistic interactions with historical data: reinforcement learning for market making | Aug 5, 2024 | Offline RL | —Unverified | 0 |
| Decision SpikeFormer: Spike-Driven Transformer for Decision Making | Apr 4, 2025 | D4RLDecision Making | —Unverified | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Deep RL with Hierarchical Action Exploration for Dialogue Generation | Mar 22, 2023 | Dialogue GenerationOffline RL | —Unverified | 0 |
| DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning | Feb 23, 2021 | Continuous ControlOffline RL | —Unverified | 0 |
| Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding | Jun 1, 2023 | ManagementOffline RL | —Unverified | 0 |
| Deploying Offline Reinforcement Learning with Human Feedback | Mar 13, 2023 | Decision MakingModel Selection | —Unverified | 0 |
| Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization | Jun 26, 2023 | Offline RLTest-time Adaptation | —Unverified | 0 |
| Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm | Sep 24, 2024 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Diffused Task-Agnostic Milestone Planner | Dec 6, 2023 | Decision MakingOffline RL | —Unverified | 0 |