| The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems | Jan 25, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Solving Dynamic Graph Problems with Multi-Attention Deep Reinforcement Learning | Jan 13, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Verified Probabilistic Policies for Deep Reinforcement Learning | Jan 10, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Mirror Learning: A Unifying Framework of Policy Optimisation | Jan 7, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation | Jan 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism | Jan 3, 2022 | Deep Reinforcement LearningGraph Representation Learning | CodeCode Available | 1 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Lane Change Decision-Making through Deep Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning | Dec 23, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |