| Massively Scaling Explicit Policy-conditioned Value Functions | Feb 17, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning | Feb 17, 2025 | Deep Reinforcement LearningLarge Language Model | —Unverified | 0 |
| Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning | Feb 17, 2025 | Deep Reinforcement LearningDeformable Object Manipulation | —Unverified | 0 |
| TSS GAZ PTP: Towards Improving Gumbel AlphaZero with Two-stage Self-play for Multi-constrained Electric Vehicle Routing Problems | Feb 17, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning-Based Bidding Strategies for Prosumers Trading in Double Auction-Based Transactive Energy Market | Feb 16, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Integrating Language Models for Enhanced Network State Monitoring in DRL-Based SFC Provisioning | Feb 16, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Solving Online Resource-Constrained Scheduling for Follow-Up Observation in Astronomy: a Reinforcement Learning Approach | Feb 16, 2025 | AstronomyDeep Reinforcement Learning | —Unverified | 0 |
| Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents | Feb 15, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning | Feb 13, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| AoI-Sensitive Data Forwarding with Distributed Beamforming in UAV-Assisted IoT | Feb 13, 2025 | Deep Reinforcement Learning | —Unverified | 0 |