| Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks | Sep 14, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Deterministic Portfolio Optimization | Mar 13, 2020 | Deep Reinforcement LearningPortfolio Optimization | CodeCode Available | 1 |
| Deep Intrinsically Motivated Exploration in Continuous Control | Oct 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems | Jul 10, 2019 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| DeepMind Lab2D | Nov 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers | Nov 22, 2024 | AvgDeep Reinforcement Learning | CodeCode Available | 1 |
| A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles | Feb 22, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Deep Reinforcement Learning at the Edge of the Statistical Precipice | Aug 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C | Jul 19, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep-Reinforcement-Learning-Based AoI-Aware Resource Allocation for RIS-Aided IoV Networks | Jun 17, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions | Oct 19, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach | Apr 26, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Deep-Reinforcement-Learning-based Path Planning for Industrial Robots using Distance Sensors as Observation | Jan 14, 2023 | Deep Reinforcement LearningIndustrial Robots | CodeCode Available | 1 |
| Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks | Oct 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Active Human Pose Estimation | Jan 7, 2020 | 3D Human Pose EstimationDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Adaptive Exploration of Unknown Environments | May 4, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Competition Winning Deep Reinforcement Learning Agent in microRTS | Feb 12, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Band Selection in Hyperspectral Image Classification | Mar 15, 2021 | ClassificationDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Cost-Effective Medical Diagnosis | Feb 20, 2023 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Comprehensive Survey on Self-Interpretable Neural Networks | Jan 26, 2025 | Deep Reinforcement LearningSurvey | CodeCode Available | 1 |
| RL-I2IT: Image-to-Image Translation with Deep Reinforcement Learning | Sep 24, 2023 | Auxiliary LearningDecision Making | CodeCode Available | 1 |
| A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network | Oct 12, 2020 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Process Synthesis | Sep 23, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning | Jun 3, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments | May 28, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Time Allocation and Directional Transmission in Joint Radar-Communication | May 19, 2022 | Autonomous VehiclesDecision Making Under Uncertainty | CodeCode Available | 1 |
| Deep Reinforcement Learning from Self-Play in Imperfect-Information Games | Mar 3, 2016 | Card GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems | Feb 9, 2020 | Combinatorial OptimizationDecoder | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning | Feb 7, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork | Jun 19, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone | Dec 22, 2021 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling | May 15, 2020 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments | May 11, 2020 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Dexterous Grasping with Object-Centric Visual Affordances | Sep 3, 2020 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Dec 27, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Automating DBSCAN via Deep Reinforcement Learning | Aug 9, 2022 | ClusteringComputational Efficiency | CodeCode Available | 1 |