| Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning | Jun 12, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical Reinforcement Learning for Swarm Confrontation with High Uncertainty | Jun 12, 2024 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 0 |
| Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling | Jun 11, 2024 | Deep Reinforcement LearningJob Shop Scheduling | —Unverified | 0 |
| Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning | Jun 11, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Jun 11, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach | Jun 11, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots | Jun 10, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Verification-Guided Shielding for Deep Reinforcement Learning | Jun 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? | Jun 10, 2024 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Multi-attribute Auction-based Resource Allocation for Twins Migration in Vehicular Metaverses: A GPT-based DRL Approach | Jun 8, 2024 | AttributeDeep Reinforcement Learning | —Unverified | 0 |
| Online Policy Distillation with Decision-Attention | Jun 8, 2024 | Deep Reinforcement LearningKnowledge Distillation | —Unverified | 0 |
| ChatPCG: Large Language Model-Driven Reward Design for Procedural Content Generation | Jun 7, 2024 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning | Jun 7, 2024 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Optimizing Automatic Differentiation with Deep Reinforcement Learning | Jun 7, 2024 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 |
| Probabilistic Perspectives on Error Minimization in Adversarial Reinforcement Learning | Jun 7, 2024 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning | Jun 7, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Jun 6, 2024 | Deep Reinforcement LearningDisaster Response | —Unverified | 0 |
| Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning | Jun 6, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model | Jun 6, 2024 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning | Jun 4, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Verifying the Generalization of Deep Learning to Out-of-Distribution Domains | Jun 4, 2024 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| By Fair Means or Foul: Quantifying Collusion in a Market Simulation with Deep Reinforcement Learning | Jun 4, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies | Jun 4, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning | Jun 4, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning Behavioral Mode Switching Using Optimal Control Based on a Latent Space Objective | Jun 3, 2024 | Deep Reinforcement LearningDimensionality Reduction | —Unverified | 0 |