| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters | May 27, 2022 | D4RLOffline RL | —Unverified | 0 |
| Offline Trajectory Generalization for Offline Reinforcement Learning | Apr 16, 2024 | D4RLData Augmentation | —Unverified | 0 |
| On the Role of Discount Factor in Offline Reinforcement Learning | Jun 7, 2022 | D4RLOffline RL | —Unverified | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 |
| Pareto Policy Pool for Model-based Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens | Sep 14, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Offline Behavior Distillation | Oct 30, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 0 |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Oct 30, 2024 | D4RLManagement | CodeCode Available | 0 |