| Automatic Reward Shaping from Confounded Offline Data | May 16, 2025 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution | Oct 19, 2024 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning | Oct 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Full Gradient Deep Reinforcement Learning for Average-Reward Criterion | Apr 7, 2023 | Deep Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 | 0 |
| Dealing with Limited Backhaul Capacity in Millimeter Wave Systems: A Deep Reinforcement Learning Approach | Dec 27, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study | Mar 20, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences | Feb 5, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Hierarchical Task Offloading for UAV-Assisted Vehicular Edge Computing via Deep Reinforcement Learning | Jul 8, 2025 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 | 0 |
| Modified DDPG car-following model with a real-world human driving experience with CARLA simulator | Dec 29, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| DDPG based on multi-scale strokes for financial time series trading strategy | Jun 5, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |