| Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera | May 29, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments | May 28, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning | May 28, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Intelligent Residential Energy Management System using Deep Reinforcement Learning | May 28, 2020 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems | May 27, 2020 | Deep Reinforcement LearningStarcraft | —Unverified | 0 |
| Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning | May 27, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning | May 26, 2020 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Towards intervention-centric causal reasoning in learning agents | May 26, 2020 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| Integrating LEO Satellite and UAV Relaying via Reinforcement Learning for Non-Terrestrial Networks | May 26, 2020 | Deep Reinforcement LearningDimensionality Reduction | —Unverified | 0 |
| Deep Reinforcement Learning Based Power Allocation for D2D Network | May 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO | May 25, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Optimization-driven Deep Reinforcement Learning for Robust Beamforming in IRS-assisted Wireless Communications | May 25, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Gradient Monitored Reinforcement Learning | May 25, 2020 | Atari Gamescontinuous-control | —Unverified | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 |
| Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning | May 25, 2020 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |
| Formal Methods with a Touch of Magic | May 25, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce | May 25, 2020 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Single-Agent Optimization Through Policy Iteration Using Monte-Carlo Tree Search | May 22, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Distributed Resource Scheduling for Large-Scale MEC Systems: A Multi-Agent Ensemble Deep Reinforcement Learning with Imitation Acceleration | May 21, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot | May 21, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks | May 20, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for High Level Character Control | May 20, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption | May 19, 2020 | Deep Reinforcement LearningFew-Shot Learning | —Unverified | 0 |
| Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors using Deep Reinforcement Learning | May 19, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Ultrasound Video Summarization using Deep Reinforcement Learning | May 19, 2020 | Deep Reinforcement LearningDiagnostic | CodeCode Available | 1 |