| CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms | Jun 13, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Optimizing Deep Reinforcement Learning for Adaptive Robotic Arm Control | Jun 12, 2024 | Deep Reinforcement LearningHyperparameter Optimization | —Unverified | 0 |
| Deep reinforcement learning with positional context for intraday trading | Jun 12, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive Swarm Mesh Refinement using Deep Reinforcement Learning with Local Rewards | Jun 12, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning to Disentangle Multiqubit Quantum States from Partial Observations | Jun 12, 2024 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning | Jun 12, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical Reinforcement Learning for Swarm Confrontation with High Uncertainty | Jun 12, 2024 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 0 |
| Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling | Jun 11, 2024 | Deep Reinforcement LearningJob Shop Scheduling | —Unverified | 0 |
| Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning | Jun 11, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Jun 11, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |