| Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning | Feb 18, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Finding Optimal Trading History in Reinforcement Learning for Stock Market Trading | Feb 18, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning a High-quality Robotic Wiping Policy Using Systematic Reward Analysis and Visual-Language Model Based Curriculum | Feb 18, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| A Graph-Enhanced Deep-Reinforcement Learning Framework for the Aircraft Landing Problem | Feb 18, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load Switches | Feb 18, 2025 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models | Feb 18, 2025 | Deep Reinforcement LearningRecommendation Systems | —Unverified | 0 |
| Communication Strategy on Macro-and-Micro Traffic State in Cooperative Deep Reinforcement Learning for Regional Traffic Signal Control | Feb 18, 2025 | Deep Reinforcement LearningTraffic Signal Control | —Unverified | 0 |
| TSS GAZ PTP: Towards Improving Gumbel AlphaZero with Two-stage Self-play for Multi-constrained Electric Vehicle Routing Problems | Feb 17, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning | Feb 17, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Massively Scaling Explicit Policy-conditioned Value Functions | Feb 17, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning | Feb 17, 2025 | Deep Reinforcement LearningDeformable Object Manipulation | —Unverified | 0 |
| Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning | Feb 17, 2025 | Deep Reinforcement LearningLarge Language Model | —Unverified | 0 |
| Deep Reinforcement Learning-Based Bidding Strategies for Prosumers Trading in Double Auction-Based Transactive Energy Market | Feb 16, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Integrating Language Models for Enhanced Network State Monitoring in DRL-Based SFC Provisioning | Feb 16, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Solving Online Resource-Constrained Scheduling for Follow-Up Observation in Astronomy: a Reinforcement Learning Approach | Feb 16, 2025 | AstronomyDeep Reinforcement Learning | —Unverified | 0 |
| Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents | Feb 15, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning | Feb 13, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| AoI-Sensitive Data Forwarding with Distributed Beamforming in UAV-Assisted IoT | Feb 13, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Reevaluating Policy Gradient Methods for Imperfect-Information Games | Feb 13, 2025 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception | Feb 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Improve the Training Efficiency of DRL for Wireless Communication Resource Allocation: The Role of Generative Diffusion Models | Feb 11, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| UAV-assisted Joint Mobile Edge Computing and Data Collection via Matching-enabled Deep Reinforcement Learning | Feb 11, 2025 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| PICTS: A Novel Deep Reinforcement Learning Approach for Dynamic P-I Control in Scanning Probe Microscopy | Feb 11, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MIGT: Memory Instance Gated Transformer Framework for Financial Portfolio Management | Feb 11, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding | Feb 10, 2025 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |