| Can Offline Reinforcement Learning Help Natural Language Understanding? | Sep 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Causal prompting model-based offline reinforcement learning | Jun 3, 2024 | modelOffline RL | —Unverified | 0 | 0 |
| CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Jun 11, 2024 | D4RLDenoising | —Unverified | 0 | 0 |
| Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings | May 13, 2021 | Offline RL | —Unverified | 0 | 0 |
| ChiPFormer: Transferable Chip Placement via Offline Decision Transformer | Jun 26, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning | Jun 23, 2023 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning | Jan 14, 2022 | modelMuJoCo | —Unverified | 0 | 0 |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Dec 8, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Sep 16, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |