| StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search | Jun 12, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning | Jun 12, 2020 | Deep Reinforcement LearningNegation | —Unverified | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| A Brief Look at Generalization in Visual Meta-Reinforcement Learning | Jun 12, 2020 | Deep Reinforcement LearningMeta Reinforcement Learning | —Unverified | 0 |
| Decorrelated Double Q-learning | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning for Neural Control | Jun 12, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Electric Transmission Voltage Control | Jun 11, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep reinforcement learning for optical systems: A case study of mode-locked lasers | Jun 10, 2020 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning | Jun 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning | Jun 10, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Jun 8, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Online Data Poisoning Attacks | Jun 8, 2020 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 |
| Randomized Policy Learning for Continuous State and Action MDPs | Jun 8, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Efficient Poverty Mapping using Deep Reinforcement Learning | Jun 7, 2020 | Deep Reinforcement Learningobject-detection | —Unverified | 0 |
| AutoPrivacy: Automated Layer-wise Parameter Selection for Secure Neural Network Inference | Jun 7, 2020 | Deep Reinforcement LearningPrivacy Preserving | —Unverified | 0 |
| Real-Time Model Calibration with Deep Reinforcement Learning | Jun 7, 2020 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| Dual Policy Distillation | Jun 7, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Multi-Agent Deep Reinforcement Learning Method for Cooperative Load Frequency Control of a Multi-Area Power System | Jun 4, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning to Scan: A Deep Reinforcement Learning Approach for Personalized Scanning in CT Imaging | Jun 3, 2020 | compressed sensingComputed Tomography (CT) | —Unverified | 0 |
| Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent | Jun 2, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles | Jun 1, 2020 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Distributed Voltage Regulation of Active Distribution System Based on Enhanced Multi-agent Deep Reinforcement Learning | May 31, 2020 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |
| Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios | May 31, 2020 | Autonomous NavigationCollision Avoidance | —Unverified | 0 |
| Intelligent Residential Energy Management System using Deep Reinforcement Learning | May 28, 2020 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning | May 28, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning | May 27, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems | May 27, 2020 | Deep Reinforcement LearningStarcraft | —Unverified | 0 |
| Integrating LEO Satellite and UAV Relaying via Reinforcement Learning for Non-Terrestrial Networks | May 26, 2020 | Deep Reinforcement LearningDimensionality Reduction | —Unverified | 0 |
| Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning | May 26, 2020 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Towards intervention-centric causal reasoning in learning agents | May 26, 2020 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce | May 25, 2020 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning | May 25, 2020 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |
| Gradient Monitored Reinforcement Learning | May 25, 2020 | Atari Gamescontinuous-control | —Unverified | 0 |
| Formal Methods with a Touch of Magic | May 25, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Power Allocation for D2D Network | May 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 |
| Optimization-driven Deep Reinforcement Learning for Robust Beamforming in IRS-assisted Wireless Communications | May 25, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Single-Agent Optimization Through Policy Iteration Using Monte-Carlo Tree Search | May 22, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Distributed Resource Scheduling for Large-Scale MEC Systems: A Multi-Agent Ensemble Deep Reinforcement Learning with Imitation Acceleration | May 21, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks | May 20, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for High Level Character Control | May 20, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text | May 19, 2020 | Deep Reinforcement LearningInstruction Following | —Unverified | 0 |
| Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors using Deep Reinforcement Learning | May 19, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption | May 19, 2020 | Deep Reinforcement LearningFew-Shot Learning | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Dampen the Stop-and-Go Traffic with Connected and Automated Vehicles -- A Deep Reinforcement Learning Approach | May 17, 2020 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| Learning-based Prediction, Rendering and Association Optimization for MEC-enabled Wireless Virtual Reality (VR) Network | May 17, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning Transferable Concepts in Deep Reinforcement Learning | May 16, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reinforced Coloring for End-to-End Instance Segmentation | May 14, 2020 | Deep Reinforcement LearningInstance Segmentation | —Unverified | 0 |