| GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform | Jan 27, 2021 | Deep Reinforcement LearningFriction | CodeCode Available | 1 |
| The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors | Jan 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline | Jan 24, 2021 | DecoderDeep Reinforcement Learning | —Unverified | 0 |
| GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning | Jan 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Setup Policies: Reliable Transition Between Locomotion Behaviours | Jan 23, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Multi-hop RIS-Empowered Terahertz Communications: A DRL-based Hybrid Beamforming Design | Jan 22, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Theory of Mind for Deep Reinforcement Learning in Hanabi | Jan 22, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Differentiable Trust Region Layers for Deep Reinforcement Learning | Jan 22, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control | Jan 21, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Robust Reinforcement Learning on State Observations with Learned Optimal Adversary | Jan 21, 2021 | Adversarial Attackcontinuous-control | CodeCode Available | 1 |