| Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments | May 11, 2020 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Deep Generalized Schrödinger Bridge | Sep 20, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DeepMind Lab2D | Nov 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Adversarial Policy Gradient for Deep Learning Image Augmentation | Sep 9, 2019 | ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |