| DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning | Sep 15, 2021 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 |
| Explaining RL Decisions with Trajectories | May 6, 2023 | Attributecontinuous-control | CodeCode Available | 0 |
| Experimental evaluation of offline reinforcement learning for HVAC control in buildings | Aug 15, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Offline Reinforcement Learning from Datasets with Structured Non-Stationarity | May 23, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Policy-regularized Offline Multi-objective Reinforcement Learning | Jan 4, 2024 | Multi-Objective Reinforcement LearningOffline RL | CodeCode Available | 0 |
| POPO: Pessimistic Offline Policy Optimization | Dec 26, 2020 | Offline RLQ-Learning | CodeCode Available | 0 |
| d3rlpy: An Offline Deep Reinforcement Learning Library | Nov 6, 2021 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Preference-Guided Reflective Sampling for Aligning Language Models | Aug 22, 2024 | Document SummarizationInstruction Following | CodeCode Available | 0 |
| MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning | Jun 10, 2025 | Data Augmentationmodel | CodeCode Available | 0 |