| Adversarial Deep Reinforcement Learning in Portfolio Management | Aug 29, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy | May 18, 2025 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse | Jun 28, 2022 | Continuous ControlDecision Making | CodeCode Available | 1 |
| Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning | Sep 17, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds | Oct 24, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Gradient Surgery for Multi-Task Learning | Jan 19, 2020 | Deep Reinforcement Learningimage-classification | CodeCode Available | 1 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments | Jan 30, 2022 | Autonomous VehiclesDecision Making | CodeCode Available | 1 |
| Guided Exploration with Proximal Policy Optimization using a Single Demonstration | Jul 7, 2020 | Deep Reinforcement Learning | CodeCode Available | 1 |
| GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform | Jan 27, 2021 | Deep Reinforcement LearningFriction | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Hierarchical Needs-driven Agent Learning Systems: From Deep Reinforcement Learning To Diverse Strategies | Feb 25, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sep 5, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Hieros: Hierarchical Imagination on Structured State Space Sequence World Models | Oct 8, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| High Performance on Atari Games Using Perceptual Control Architecture Without Training | Oct 8, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem | Apr 19, 2023 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism | Jan 3, 2022 | Deep Reinforcement LearningGraph Representation Learning | CodeCode Available | 1 |
| Asset Allocation: From Markowitz to Deep Reinforcement Learning | Jul 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO | May 25, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Implementation Matters in Deep RL: A Case Study on PPO and TRPO | May 1, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Adversarially Guided Actor-Critic | Feb 8, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Improvable Gap Balancing for Multi-Task Learning | Jul 28, 2023 | Deep Reinforcement LearningMulti-Task Learning | CodeCode Available | 1 |
| Inclined Quadrotor Landing using Deep Reinforcement Learning | Mar 16, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Integrating Deep Reinforcement Learning Networks with Health System Simulations | Jul 21, 2020 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Aug 13, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Adversarial Policy Gradient for Deep Learning Image Augmentation | Sep 9, 2019 | ClassificationDeep Learning | CodeCode Available | 1 |
| Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural Networks | Oct 17, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Interferobot: aligning an optical interferometer by a reinforcement learning agent | Jun 3, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning | May 23, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer | Feb 4, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Iterative Amortized Policy Optimization | Oct 20, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Job Shop Scheduling via Deep Reinforcement Learning: a Sequence to Sequence approach | Aug 3, 2023 | Combinatorial OptimizationDecoder | CodeCode Available | 1 |
| Joint Deep Reinforcement Learning and Unfolding: Beam Selection and Precoding for mmWave Multiuser MIMO with Lens Arrays | Jan 5, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration | Dec 1, 2020 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning | Apr 3, 2020 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion | May 7, 2020 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 |
| Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Decision Trees as Amortized Structure Inference | Mar 10, 2025 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Learning Generalizable Policy for Obstacle-Aware Autonomous Drone Racing | Nov 6, 2024 | Deep Reinforcement LearningDrone navigation | CodeCode Available | 1 |
| Learning Guidance Rewards with Trajectory-space Smoothing | Oct 23, 2020 | AttributeDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Improvement Heuristics for Solving Routing Problems | Dec 12, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 1 |
| Learning Large Neighborhood Search Policy for Integer Programming | Nov 1, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning Off-Policy with Online Planning | Aug 23, 2020 | ARCContinuous Control | CodeCode Available | 1 |
| Continuous Deep Q-Learning with Model-based Acceleration | Mar 2, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 |