| Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments | May 28, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Time Allocation and Directional Transmission in Joint Radar-Communication | May 19, 2022 | Autonomous VehiclesDecision Making Under Uncertainty | CodeCode Available | 1 |
| Deep Reinforcement Learning from Self-Play in Imperfect-Information Games | Mar 3, 2016 | Card GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems | Feb 9, 2020 | Combinatorial OptimizationDecoder | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning | Feb 7, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork | Jun 19, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone | Dec 22, 2021 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling | May 15, 2020 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments | May 11, 2020 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Dexterous Grasping with Object-Centric Visual Affordances | Sep 3, 2020 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Dec 27, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Automating DBSCAN via Deep Reinforcement Learning | Aug 9, 2022 | ClusteringComputational Efficiency | CodeCode Available | 1 |