| LiLM-RDB-SFC: Lightweight Language Model with Relational Database-Guided DRL for Optimized SFC Provisioning | Jul 15, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Sensing Accuracy Optimization for Multi-UAV SAR Interferometry with Data Offloading | Jul 15, 2025 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound | Jul 15, 2025 | counterfactualDecision Making | —Unverified | 0 |
| Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks | Jul 13, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Hierarchical Task Offloading for UAV-Assisted Vehicular Edge Computing via Deep Reinforcement Learning | Jul 8, 2025 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning | Jul 7, 2025 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains | Jul 2, 2025 | Atari GamesChatbot | CodeCode Available | 0 |
| Explainable AI for Radar Resource Management: Modified LIME in Deep Reinforcement Learning | Jun 26, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| rQdia: Regularizing Q-Value Distributions With Image Augmentation | Jun 26, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning-Based Resource Management in Integrated Sensing and Communication Systems | Jun 25, 2025 | Deep Reinforcement LearningIntegrated sensing and communication | —Unverified | 0 |
| Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management | Jun 25, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| GymPN: A Library for Decision-Making in Process Management Systems | Jun 25, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design | Jun 24, 2025 | Deep Reinforcement LearningZero-shot Generalization | CodeCode Available | 0 |
| Optimal Design of Experiment for Electrochemical Parameter Identification of Li-ion Battery via Deep Reinforcement Learning | Jun 23, 2025 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |
| Efficient Beam Selection for ISAC in Cell-Free Massive MIMO via Digital Twin-Assisted Deep Reinforcement Learning | Jun 23, 2025 | Deep Reinforcement LearningGenerative Adversarial Network | —Unverified | 0 |
| Adaptive Social Metaverse Streaming based on Federated Multi-Agent Deep Reinforcement Learning | Jun 19, 2025 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios | Jun 19, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 |
| Joint Spectrum Sensing and Resource Allocation for OFDMA-based Underwater Acoustic Communications | Jun 16, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization | Jun 16, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning | Jun 16, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method | Jun 16, 2025 | Deep Reinforcement LearningSensor Fusion | —Unverified | 0 |
| Federated Neuroevolution O-RAN: Enhancing the Robustness of Deep Reinforcement Learning xApps | Jun 15, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty | Jun 14, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning | Jun 13, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Joint Beamforming with Extremely Large Scale RIS: A Sequential Multi-Agent A2C Approach | Jun 12, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Patient-Specific Deep Reinforcement Learning for Automatic Replanning in Head-and-Neck Cancer Proton Therapy | Jun 11, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning | Jun 11, 2025 | Deep Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization | Jun 11, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Foundation Model-Aided Deep Reinforcement Learning for RIS-Assisted Wireless Communication | Jun 11, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| Towards Robust Deep Reinforcement Learning against Environmental State Perturbation | Jun 10, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Modular Recurrence in Contextual MDPs for Universal Morphology Control | Jun 10, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation | Jun 10, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness | Jun 10, 2025 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| An Intelligent Fault Self-Healing Mechanism for Cloud AI Systems via Integration of Large Language Models and Deep Reinforcement Learning | Jun 9, 2025 | Deep Reinforcement LearningLarge Language Model | —Unverified | 0 |
| Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator | Jun 9, 2025 | Deep Reinforcement LearningFederated Learning | CodeCode Available | 0 |
| From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks | Jun 9, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Deep reinforcement learning for near-deterministic preparation of cubic- and quartic-phase gates in photonic quantum computing | Jun 9, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms | Jun 9, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Energy-efficient Deep Reinforcement Learning-based Network Function Disaggregation in Hybrid Non-terrestrial Open Radio Access Networks | Jun 7, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep reinforcement learning-based joint real-time energy scheduling for green buildings with heterogeneous battery energy storage devices | Jun 7, 2025 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Improving choice model specification using reinforcement learning | Jun 6, 2025 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| The Economic Dispatch of Power-to-Gas Systems with Deep Reinforcement Learning:Tackling the Challenge of Delayed Rewards with Long-Term Energy Storage | Jun 6, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Can Artificial Intelligence Trade the Stock Market? | Jun 5, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration | Jun 4, 2025 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems | Jun 4, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals | Jun 4, 2025 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |