| Efficient Entropy for Policy Gradient with Multidimensional Action Space | Jun 2, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Cutting-Agent Learning for Video Object Segmentation | Jun 1, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning | Jun 1, 2018 | BinarizationDeep Reinforcement Learning | —Unverified | 0 |
| SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation | Jun 1, 2018 | Deep Reinforcement LearningObject | —Unverified | 0 |
| SeedNet: Automatic Seed Generation With Deep Reinforcement Learning for Robust Interactive Segmentation | Jun 1, 2018 | Deep Reinforcement LearningInteractive Segmentation | —Unverified | 0 |
| Deep Reinforcement Learning of Region Proposal Networks for Object Detection | Jun 1, 2018 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition | Jun 1, 2018 | Action RecognitionDeep Reinforcement Learning | —Unverified | 0 |
| Scalable Construction and Reasoning of Massive Knowledge Bases | Jun 1, 2018 | ArticlesDeep Reinforcement Learning | —Unverified | 0 |
| Mining Evidences for Concept Stock Recommendation | Jun 1, 2018 | Deep Reinforcement LearningInformation Retrieval | —Unverified | 0 |
| Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems | Jun 1, 2018 | Deep Reinforcement LearningMontezuma's Revenge | —Unverified | 0 |
| Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling | Jun 1, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update | May 31, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models | May 30, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 2 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Playing hard exploration games by watching YouTube | May 29, 2018 | Deep Reinforcement LearningMontezuma's Revenge | CodeCode Available | 1 |
| Observe and Look Further: Achieving Consistent Performance on Atari | May 29, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation | May 26, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning | May 24, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning of Marked Temporal Point Processes | May 23, 2018 | Deep Reinforcement LearningMarketing | CodeCode Available | 0 |
| Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents | May 22, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients | May 22, 2018 | Deep Reinforcement LearningDistributed Optimization | —Unverified | 0 |
| Verifiable Reinforcement Learning via Policy Extraction | May 22, 2018 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Evolution-Guided Policy Gradient in Reinforcement Learning | May 21, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |