| Recurrent Off-policy Baselines for Memory-based Continuous Control | Oct 25, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Uniformly Conservative Exploration in Reinforcement Learning | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement Learning | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem | Oct 6, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 |
| Replay-Guided Adversarial Environment Design | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values | Oct 4, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |