| Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression | Oct 25, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance | Oct 17, 2024 | Offline RLRe-Ranking | CodeCode Available | 1 |
| Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining | Oct 1, 2024 | Atari Gamesmodel | CodeCode Available | 1 |
| DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors | Sep 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer | Jun 10, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Strategically Conservative Q-Learning | Jun 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |
| Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination | May 28, 2024 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL | May 28, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Q-value Regularized Transformer for Offline Reinforcement Learning | May 27, 2024 | D4RLOffline RL | CodeCode Available | 1 |