| Deep Reinforcement Learning from Self-Play in Imperfect-Information Games | Mar 3, 2016 | Card GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Continuous Deep Q-Learning with Model-based Acceleration | Mar 2, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Asynchronous Methods for Deep Reinforcement Learning | Feb 4, 2016 | Atari GamesCPU | CodeCode Available | 1 |
| Multiagent Cooperation and Competition with Deep Reinforcement Learning | Nov 27, 2015 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning in Parameterized Action Space | Nov 13, 2015 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning with Double Q-learning | Sep 22, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Continuous control with deep reinforcement learning | Sep 9, 2015 | Action Detectioncontinuous-control | CodeCode Available | 1 |
| Giraffe: Using Deep Reinforcement Learning to Play Chess | Sep 4, 2015 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Recurrent Q-Learning for Partially Observable MDPs | Jul 23, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning | Jun 6, 2015 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 1 |
| Playing Atari with Deep Reinforcement Learning | Dec 19, 2013 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound | Jul 15, 2025 | counterfactualDecision Making | —Unverified | 0 |
| Sensing Accuracy Optimization for Multi-UAV SAR Interferometry with Data Offloading | Jul 15, 2025 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| LiLM-RDB-SFC: Lightweight Language Model with Relational Database-Guided DRL for Optimized SFC Provisioning | Jul 15, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks | Jul 13, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Hierarchical Task Offloading for UAV-Assisted Vehicular Edge Computing via Deep Reinforcement Learning | Jul 8, 2025 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning | Jul 7, 2025 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains | Jul 2, 2025 | Atari GamesChatbot | CodeCode Available | 0 |
| rQdia: Regularizing Q-Value Distributions With Image Augmentation | Jun 26, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Explainable AI for Radar Resource Management: Modified LIME in Deep Reinforcement Learning | Jun 26, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| GymPN: A Library for Decision-Making in Process Management Systems | Jun 25, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management | Jun 25, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Learning-Based Resource Management in Integrated Sensing and Communication Systems | Jun 25, 2025 | Deep Reinforcement LearningIntegrated sensing and communication | —Unverified | 0 |
| TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design | Jun 24, 2025 | Deep Reinforcement LearningZero-shot Generalization | CodeCode Available | 0 |
| Optimal Design of Experiment for Electrochemical Parameter Identification of Li-ion Battery via Deep Reinforcement Learning | Jun 23, 2025 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |