| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization | Mar 31, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning | May 20, 2025 | MathOffline RL | —Unverified | 0 |
| Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents | Apr 3, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Unsupervised-to-Online Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Jun 20, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| User-Interactive Offline Reinforcement Learning | May 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning | Feb 3, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Value Penalized Q-Learning for Recommender Systems | Oct 15, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Variational oracle guiding for reinforcement learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |