| Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning | May 27, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning | May 26, 2020 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Towards intervention-centric causal reasoning in learning agents | May 26, 2020 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| Integrating LEO Satellite and UAV Relaying via Reinforcement Learning for Non-Terrestrial Networks | May 26, 2020 | Deep Reinforcement LearningDimensionality Reduction | —Unverified | 0 |
| Deep Reinforcement Learning Based Power Allocation for D2D Network | May 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO | May 25, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Optimization-driven Deep Reinforcement Learning for Robust Beamforming in IRS-assisted Wireless Communications | May 25, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Gradient Monitored Reinforcement Learning | May 25, 2020 | Atari Gamescontinuous-control | —Unverified | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 |
| Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning | May 25, 2020 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |