| LLM-Assisted Red Teaming of Diffusion Models through "Failures Are Fated, But Can Be Faded" | Oct 22, 2024 | Deep Reinforcement LearningRed Teaming | —Unverified | 0 |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Oct 21, 2024 | ClusteringD4RL | —Unverified | 0 |
| Patrol Security Game: Defending Against Adversary with Freedom in Attack Timing, Location, and Duration | Oct 21, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 |
| Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning | Oct 21, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| AssemblyComplete: 3D Combinatorial Construction with Deep Reinforcement Learning | Oct 20, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution | Oct 19, 2024 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning | Oct 19, 2024 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks | Oct 19, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning in Non-Markov Market-Making | Oct 18, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Streaming Deep Reinforcement Learning Finally Works | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation | Oct 18, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Interpretable end-to-end Neurosymbolic Reinforcement Learning agents | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Adversarial Inception Backdoor Attacks against Reinforcement Learning | Oct 17, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games | Oct 17, 2024 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Deep Reinforcement Learning for Online Optimal Execution Strategies | Oct 17, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A Hierarchical DRL Approach for Resource Optimization in Multi-RIS Multi-Operator Networks | Oct 16, 2024 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| AoI-Aware Resource Allocation for Smart Multi-QoS Provisioning | Oct 16, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach | Oct 16, 2024 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| Spectrum Sharing using Deep Reinforcement Learning in Vehicular Networks | Oct 16, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Physical Informed-Inspired Deep Reinforcement Learning Based Bi-Level Programming for Microgrid Scheduling | Oct 15, 2024 | AutoMLComputational Efficiency | —Unverified | 0 |
| Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement Learning Approach | Oct 15, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning | Oct 15, 2024 | AttributeDeep Reinforcement Learning | —Unverified | 0 |
| DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation | Oct 14, 2024 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Compositional Shielding and Reinforcement Learning for Multi-Agent Systems | Oct 14, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach | Oct 14, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation | Oct 14, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale | Oct 13, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning | Oct 13, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 2 |
| Multi-Agent Actor-Critics in Autonomous Cyber Defense | Oct 11, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL | Oct 11, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Exploring Natural Language-Based Strategies for Efficient Number Learning in Children through Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Large Vision Model-Enhanced Digital Twin with Deep Reinforcement Learning for User Association and Load Balancing in Dynamic Wireless Networks | Oct 10, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Neuroplastic Expansion in Deep Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Masked Generative Priors Improve World Models Sequence Modelling Capabilities | Oct 10, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Variations in Multi-Agent Actor-Critic Frameworks for Joint Optimizations in UAV Swarm Networks: Recent Evolution, Challenges, and Directions | Oct 9, 2024 | Deep Reinforcement LearningTrajectory Planning | —Unverified | 0 |
| AAAI Workshop on AI Planning for Cyber-Physical Systems -- CAIPI24 | Oct 8, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective | Oct 8, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Learning-Based Shielding for Safe Autonomy under Unknown Dynamics | Oct 7, 2024 | Deep Reinforcement LearningUncertainty Quantification | —Unverified | 0 |
| Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning | Oct 7, 2024 | Deep Reinforcement LearningFPS Games | —Unverified | 0 |
| Toward Debugging Deep Reinforcement Learning Programs with RLExplorer | Oct 6, 2024 | Deep Reinforcement LearningFault Diagnosis | —Unverified | 0 |
| Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization | Oct 4, 2024 | Deep Reinforcement LearningQuantization | CodeCode Available | 1 |
| Latent Action Priors for Locomotion with Deep Reinforcement Learning | Oct 4, 2024 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Joint Channel Selection using FedDRL in V2X | Oct 3, 2024 | channel selectionDecision Making | —Unverified | 0 |
| Leveraging Event Streams with Deep Reinforcement Learning for End-to-End UAV Tracking | Oct 3, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Semantic-Guided RL for Interpretable Feature Engineering | Oct 3, 2024 | Automated Feature EngineeringDeep Reinforcement Learning | —Unverified | 0 |
| Realizable Continuous-Space Shields for Safe Reinforcement Learning | Oct 2, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks | Oct 2, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |