| DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation | Oct 14, 2024 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Compositional Shielding and Reinforcement Learning for Multi-Agent Systems | Oct 14, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach | Oct 14, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation | Oct 14, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale | Oct 13, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning | Oct 13, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 2 |
| Multi-Agent Actor-Critics in Autonomous Cyber Defense | Oct 11, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL | Oct 11, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Exploring Natural Language-Based Strategies for Efficient Number Learning in Children through Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Large Vision Model-Enhanced Digital Twin with Deep Reinforcement Learning for User Association and Load Balancing in Dynamic Wireless Networks | Oct 10, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Neuroplastic Expansion in Deep Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Masked Generative Priors Improve World Models Sequence Modelling Capabilities | Oct 10, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Variations in Multi-Agent Actor-Critic Frameworks for Joint Optimizations in UAV Swarm Networks: Recent Evolution, Challenges, and Directions | Oct 9, 2024 | Deep Reinforcement LearningTrajectory Planning | —Unverified | 0 |
| AAAI Workshop on AI Planning for Cyber-Physical Systems -- CAIPI24 | Oct 8, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective | Oct 8, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Learning-Based Shielding for Safe Autonomy under Unknown Dynamics | Oct 7, 2024 | Deep Reinforcement LearningUncertainty Quantification | —Unverified | 0 |
| Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning | Oct 7, 2024 | Deep Reinforcement LearningFPS Games | —Unverified | 0 |
| Toward Debugging Deep Reinforcement Learning Programs with RLExplorer | Oct 6, 2024 | Deep Reinforcement LearningFault Diagnosis | —Unverified | 0 |
| Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization | Oct 4, 2024 | Deep Reinforcement LearningQuantization | CodeCode Available | 1 |
| Latent Action Priors for Locomotion with Deep Reinforcement Learning | Oct 4, 2024 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Joint Channel Selection using FedDRL in V2X | Oct 3, 2024 | channel selectionDecision Making | —Unverified | 0 |
| Leveraging Event Streams with Deep Reinforcement Learning for End-to-End UAV Tracking | Oct 3, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Semantic-Guided RL for Interpretable Feature Engineering | Oct 3, 2024 | Automated Feature EngineeringDeep Reinforcement Learning | —Unverified | 0 |
| Realizable Continuous-Space Shields for Safe Reinforcement Learning | Oct 2, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks | Oct 2, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |