| Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound | Jul 15, 2025 | counterfactualDecision Making | —Unverified | 0 |
| Sensing Accuracy Optimization for Multi-UAV SAR Interferometry with Data Offloading | Jul 15, 2025 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| LiLM-RDB-SFC: Lightweight Language Model with Relational Database-Guided DRL for Optimized SFC Provisioning | Jul 15, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks | Jul 13, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Hierarchical Task Offloading for UAV-Assisted Vehicular Edge Computing via Deep Reinforcement Learning | Jul 8, 2025 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning | Jul 7, 2025 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains | Jul 2, 2025 | Atari GamesChatbot | CodeCode Available | 0 |
| Explainable AI for Radar Resource Management: Modified LIME in Deep Reinforcement Learning | Jun 26, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| rQdia: Regularizing Q-Value Distributions With Image Augmentation | Jun 26, 2025 | continuous-controlContinuous Control | —Unverified | 0 |