| Reinforcement Learning for Molecular Design Guided by Quantum Mechanics | Feb 18, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge | Feb 18, 2020 | Common Sense Reasoningcontinuous-control | —Unverified | 0 |
| Multi-Issue Bargaining With Deep Reinforcement Learning | Feb 18, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Reinforced active learning for image segmentation | Feb 16, 2020 | Active LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning | Feb 16, 2020 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Deep RL Agent for a Real-Time Action Strategy Game | Feb 15, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning | Feb 14, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Deep Reinforcement Learning-Based Beam Tracking for Low-Latency Services in Vehicular Networks | Feb 13, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization | Feb 13, 2020 | Deep Reinforcement LearningMultiobjective Optimization | —Unverified | 0 |
| Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing | Feb 12, 2020 | Atari GamesDecision Making | —Unverified | 0 |
| Learning Multi-Agent Coordination through Connectivity-driven Communication | Feb 12, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| AI Driven Heterogeneous MEC System with UAV Assistance for Dynamic Environment -- Challenges and Solutions | Feb 11, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| On Reward Shaping for Mobile Robot Navigation: A Reinforcement Learning and SLAM Based Approach | Feb 10, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Proficiency Constrained Multi-Agent Reinforcement Learning for Environment-Adaptive Multi UAV-UGV Teaming | Feb 10, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems | Feb 9, 2020 | Combinatorial OptimizationDecoder | CodeCode Available | 1 |
| Reward Tweaking: Maximizing the Total Reward While Planning for Short Horizons | Feb 9, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling | Feb 8, 2020 | Deep Reinforcement LearningImage Retrieval | —Unverified | 0 |
| RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning | Feb 8, 2020 | Deep Reinforcement LearningMusic Generation | —Unverified | 0 |
| Learning Whole-body Motor Skills for Humanoids | Feb 7, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning | Feb 7, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids | Feb 7, 2020 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Unboxing MAC Protocol Design Optimization Using Deep Learning | Feb 6, 2020 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach | Feb 6, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Soft Hindsight Experience Replay | Feb 6, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Bootstrapping a DQN Replay Memory with Synthetic Experiences | Feb 4, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |