| XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study | Feb 26, 2025 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| Research on Edge Computing and Cloud Collaborative Resource Scheduling Optimization Based on Deep Reinforcement Learning | Feb 26, 2025 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| A Multi-Agent DRL-Based Framework for Optimal Resource Allocation and Twin Migration in the Multi-Tier Vehicular Metaverse | Feb 26, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Target Defense with Multiple Defenders and an Agile Attacker via Residual Policy Learning | Feb 25, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Applications of deep reinforcement learning to urban transit network design | Feb 25, 2025 | Deep Reinforcement LearningMetaheuristic Optimization | —Unverified | 0 |
| Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric | Feb 25, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Controlling dynamics of stochastic systems with deep reinforcement learning | Feb 25, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Toward 6-DOF Autonomous Underwater Vehicle Energy-Aware Position Control based on Deep Reinforcement Learning: Preliminary Results | Feb 25, 2025 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making | Feb 24, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Distributed Coordination for Heterogeneous Non-Terrestrial Networks | Feb 24, 2025 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation | Feb 24, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement Error | Feb 23, 2025 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |
| Facilitating Emergency Vehicle Passage in Congested Urban Areas Using Multi-agent Deep Reinforcement Learning | Feb 23, 2025 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents | Feb 22, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Human-AI Collaboration in Cloud Security: Cognitive Hierarchy-Driven Deep Reinforcement Learning | Feb 22, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| SpikeRL: A Scalable and Energy-efficient Framework for Deep Spiking Reinforcement Learning | Feb 21, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Hyperspherical Normalization for Scalable Deep Reinforcement Learning | Feb 21, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning | Feb 21, 2025 | Action GenerationDecoder | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Improving Collision-Free Success Rate For Object Goal Visual Navigation Via Two-Stage Training With Collision Prediction | Feb 19, 2025 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Learning Symbolic Task Decompositions for Multi-Agent Teams | Feb 19, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Atomic Proximal Policy Optimization for Electric Robo-Taxi Dispatch and Charger Allocation | Feb 19, 2025 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Multi-Target Radar Search and Track Using Sequence-Capable Deep Reinforcement Learning | Feb 19, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 0 |
| Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning | Feb 18, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Finding Optimal Trading History in Reinforcement Learning for Stock Market Trading | Feb 18, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning a High-quality Robotic Wiping Policy Using Systematic Reward Analysis and Visual-Language Model Based Curriculum | Feb 18, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| A Graph-Enhanced Deep-Reinforcement Learning Framework for the Aircraft Landing Problem | Feb 18, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load Switches | Feb 18, 2025 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models | Feb 18, 2025 | Deep Reinforcement LearningRecommendation Systems | —Unverified | 0 |
| Communication Strategy on Macro-and-Micro Traffic State in Cooperative Deep Reinforcement Learning for Regional Traffic Signal Control | Feb 18, 2025 | Deep Reinforcement LearningTraffic Signal Control | —Unverified | 0 |
| TSS GAZ PTP: Towards Improving Gumbel AlphaZero with Two-stage Self-play for Multi-constrained Electric Vehicle Routing Problems | Feb 17, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning | Feb 17, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Massively Scaling Explicit Policy-conditioned Value Functions | Feb 17, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning | Feb 17, 2025 | Deep Reinforcement LearningDeformable Object Manipulation | —Unverified | 0 |
| Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning | Feb 17, 2025 | Deep Reinforcement LearningLarge Language Model | —Unverified | 0 |
| Deep Reinforcement Learning-Based Bidding Strategies for Prosumers Trading in Double Auction-Based Transactive Energy Market | Feb 16, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Integrating Language Models for Enhanced Network State Monitoring in DRL-Based SFC Provisioning | Feb 16, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Solving Online Resource-Constrained Scheduling for Follow-Up Observation in Astronomy: a Reinforcement Learning Approach | Feb 16, 2025 | AstronomyDeep Reinforcement Learning | —Unverified | 0 |
| Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents | Feb 15, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning | Feb 13, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| AoI-Sensitive Data Forwarding with Distributed Beamforming in UAV-Assisted IoT | Feb 13, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Reevaluating Policy Gradient Methods for Imperfect-Information Games | Feb 13, 2025 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception | Feb 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Improve the Training Efficiency of DRL for Wireless Communication Resource Allocation: The Role of Generative Diffusion Models | Feb 11, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| UAV-assisted Joint Mobile Edge Computing and Data Collection via Matching-enabled Deep Reinforcement Learning | Feb 11, 2025 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| PICTS: A Novel Deep Reinforcement Learning Approach for Dynamic P-I Control in Scanning Probe Microscopy | Feb 11, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MIGT: Memory Instance Gated Transformer Framework for Financial Portfolio Management | Feb 11, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding | Feb 10, 2025 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |