| Active Advantage-Aligned Online Reinforcement Learning with Offline Data | Feb 11, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency | Mar 3, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems | Mar 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood | Jun 10, 2025 | Computational EfficiencyD4RL | CodeCode Available | 0 | 5 |
| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 | 5 |
| DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty | Jun 14, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning | Jun 10, 2025 | Data Augmentationmodel | CodeCode Available | 0 | 5 |
| Offline RL With Resource Constrained Online Deployment | Oct 7, 2021 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Jun 18, 2025 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Jun 16, 2022 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Offline Equilibrium Finding | Jul 12, 2022 | Offline RL | CodeCode Available | 0 | 5 |
| Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees | Nov 14, 2023 | Offline RL | CodeCode Available | 0 | 5 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 | 5 |
| Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning | Oct 16, 2023 | ChatbotOffline RL | CodeCode Available | 0 | 5 |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Oct 30, 2024 | D4RLManagement | CodeCode Available | 0 | 5 |
| Multi-Game Decision Transformers | May 30, 2022 | Atari GamesOffline RL | CodeCode Available | 0 | 5 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 | 5 |
| Continual Task Learning through Adaptive Policy Self-Composition | Nov 18, 2024 | Continual LearningOffline RL | CodeCode Available | 0 | 5 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning | Jun 24, 2020 | Atari GamesDQN Replay Dataset | CodeCode Available | 0 | 5 |
| Offline Reinforcement Learning from Datasets with Structured Non-Stationarity | May 23, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage | Oct 27, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning | Oct 2, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator | Mar 5, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Model-based Offline Policy Optimization with Adversarial Network | Sep 5, 2023 | modelOffline RL | CodeCode Available | 0 | 5 |