| CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning | Apr 18, 2022 | ChatbotOffline RL | CodeCode Available | 2 |
| When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning? | Apr 12, 2022 | Atari GamesDiagnostic | —Unverified | 0 |
| Settling the Sample Complexity of Model-Based Offline Reinforcement Learning | Apr 11, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 Diabetes | Apr 7, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| CIRS: Bursting Filter Bubbles by Counterfactual Interactive Recommender System | Apr 4, 2022 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps | Mar 25, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Bellman Residual Orthogonalization for Offline Reinforcement Learning | Mar 24, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning | Mar 21, 2022 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Semi-Markov Offline Reinforcement Learning for Healthcare | Mar 17, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |