| Reward Tweaking: Maximizing the Total Reward While Planning for Short Horizons | Feb 9, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling | Feb 8, 2020 | Deep Reinforcement LearningImage Retrieval | —Unverified | 0 |
| RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning | Feb 8, 2020 | Deep Reinforcement LearningMusic Generation | —Unverified | 0 |
| Learning Whole-body Motor Skills for Humanoids | Feb 7, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning | Feb 7, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids | Feb 7, 2020 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Unboxing MAC Protocol Design Optimization Using Deep Learning | Feb 6, 2020 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach | Feb 6, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Soft Hindsight Experience Replay | Feb 6, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Bootstrapping a DQN Replay Memory with Synthetic Experiences | Feb 4, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |