| Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills | Sep 24, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Bootstrapped Transformer for Offline Reinforcement Learning | Jun 17, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism | Mar 22, 2021 | Imitation LearningMulti-Armed Bandits | —Unverified | 0 | 0 |
| Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies | Dec 15, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning | Jul 8, 2021 | Face DetectionFace Recognition | —Unverified | 0 | 0 |
| Can Offline Reinforcement Learning Help Natural Language Understanding? | Sep 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Causal prompting model-based offline reinforcement learning | Jun 3, 2024 | modelOffline RL | —Unverified | 0 | 0 |
| CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Jun 11, 2024 | D4RLDenoising | —Unverified | 0 | 0 |
| Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings | May 13, 2021 | Offline RL | —Unverified | 0 | 0 |
| ChiPFormer: Transferable Chip Placement via Offline Decision Transformer | Jun 26, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning | Jun 23, 2023 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning | Jan 14, 2022 | modelMuJoCo | —Unverified | 0 | 0 |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Dec 8, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Sep 16, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Context-Former: Stitching via Latent Conditioned Sequence Modeling | Jan 29, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 | 0 |
| Contrastive Learning as Goal-Conditioned Reinforcement Learning | Jun 15, 2022 | Contrastive LearningData Augmentation | —Unverified | 0 | 0 |
| Contrastive Value Learning: Implicit Models for Simple Offline RL | Nov 3, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Corruption-Robust Offline Reinforcement Learning | Jun 11, 2021 | Adversarial RobustnessOffline RL | —Unverified | 0 | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning | Mar 29, 2024 | counterfactualOffline RL | —Unverified | 0 | 0 |
| CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning | Dec 19, 2023 | NavigateOffline RL | —Unverified | 0 | 0 |
| Curriculum Offline Imitating Learning | Dec 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning | Aug 15, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning | Mar 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Data Center Cooling System Optimization Using Offline Reinforcement Learning | Jan 25, 2025 | Graph Neural NetworkOffline RL | —Unverified | 0 | 0 |
| Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data | Oct 16, 2022 | Model SelectionOffline RL | —Unverified | 0 | 0 |
| Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning | Sep 29, 2021 | Multi-Task LearningOffline RL | —Unverified | 0 | 0 |
| Consistent time travel for realistic interactions with historical data: reinforcement learning for market making | Aug 5, 2024 | Offline RL | —Unverified | 0 | 0 |
| Decision SpikeFormer: Spike-Driven Transformer for Decision Making | Apr 4, 2025 | D4RLDecision Making | —Unverified | 0 | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 | 0 |
| Deep RL with Hierarchical Action Exploration for Dialogue Generation | Mar 22, 2023 | Dialogue GenerationOffline RL | —Unverified | 0 | 0 |
| DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning | Feb 23, 2021 | Continuous ControlOffline RL | —Unverified | 0 | 0 |
| Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding | Jun 1, 2023 | ManagementOffline RL | —Unverified | 0 | 0 |
| Deploying Offline Reinforcement Learning with Human Feedback | Mar 13, 2023 | Decision MakingModel Selection | —Unverified | 0 | 0 |
| Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization | Jun 26, 2023 | Offline RLTest-time Adaptation | —Unverified | 0 | 0 |
| Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm | Sep 24, 2024 | Offline RLOff-policy evaluation | —Unverified | 0 | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |
| Diffused Task-Agnostic Milestone Planner | Dec 6, 2023 | Decision MakingOffline RL | —Unverified | 0 | 0 |