| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions | Mar 30, 2023 | DiversityOffline RL | —Unverified | 0 |
| Deep RL with Hierarchical Action Exploration for Dialogue Generation | Mar 22, 2023 | Dialogue GenerationOffline RL | —Unverified | 0 |
| Adaptive Policy Learning for Offline-to-Online Reinforcement Learning | Mar 14, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Deploying Offline Reinforcement Learning with Human Feedback | Mar 13, 2023 | Decision MakingModel Selection | —Unverified | 0 |
| Graph Decision Transformer | Mar 7, 2023 | Offline RLOpenAI Gym | —Unverified | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 |
| On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples | Mar 7, 2023 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Learning to Influence Human Behavior with Offline Reinforcement Learning | Mar 3, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |