| MOReL: Model-Based Offline Reinforcement Learning | Dec 1, 2020 | modelOffline RL | —Unverified | 0 | 0 |
| MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning | Jan 6, 2024 | Offline RLRobot Manipulation | —Unverified | 0 | 0 |
| Multi-Objective Decision Transformers for Offline Reinforcement Learning | Aug 31, 2023 | D4RLOffline RL | —Unverified | 0 | 0 |
| Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning | Oct 15, 2024 | Collision AvoidanceOffline RL | —Unverified | 0 | 0 |
| Multi-Object Navigation in real environments using hybrid policies | Jan 24, 2024 | Imitation LearningObject | —Unverified | 0 | 0 |
| Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning | Nov 12, 2024 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game | May 31, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Near-Optimal Offline Reinforcement Learning via Double Variance Reduction | Feb 2, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning | Jul 7, 2020 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Neural Network Approximation for Pessimistic Offline Reinforcement Learning | Dec 19, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Off-dynamics Conditional Diffusion Planners | Oct 16, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control | Nov 10, 2024 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Offline Actor-Critic Reinforcement Learning Scales to Large Models | Feb 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives | Jan 3, 2023 | Offline RLRecommendation Systems | —Unverified | 0 | 0 |
| Offline Fictitious Self-Play for Competitive Games | Feb 29, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare | Oct 10, 2024 | Common Sense ReasoningData Augmentation | —Unverified | 0 | 0 |
| Offline Inverse Reinforcement Learning | Jun 9, 2021 | Data AugmentationImitation Learning | —Unverified | 0 | 0 |
| Offline Model-Based Reinforcement Learning with Anti-Exploration | Aug 20, 2024 | D4RLmodel | —Unverified | 0 | 0 |
| Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization | Jun 15, 2023 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Offline Multi-task Transfer RL with Representational Penalization | Feb 19, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Policy Evaluation and Optimization under Confounding | Nov 29, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 | 0 |
| Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data | Jun 24, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Policy Optimization in RL with Variance Regularizaton | Dec 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Offline Policy Optimization with Variance Regularization | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |