| Efficient Entropy for Policy Gradient with Multidimensional Action Space | Jun 2, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Cutting-Agent Learning for Video Object Segmentation | Jun 1, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning | Jun 1, 2018 | BinarizationDeep Reinforcement Learning | —Unverified | 0 |
| SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation | Jun 1, 2018 | Deep Reinforcement LearningObject | —Unverified | 0 |
| SeedNet: Automatic Seed Generation With Deep Reinforcement Learning for Robust Interactive Segmentation | Jun 1, 2018 | Deep Reinforcement LearningInteractive Segmentation | —Unverified | 0 |
| Deep Reinforcement Learning of Region Proposal Networks for Object Detection | Jun 1, 2018 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition | Jun 1, 2018 | Action RecognitionDeep Reinforcement Learning | —Unverified | 0 |
| Scalable Construction and Reasoning of Massive Knowledge Bases | Jun 1, 2018 | ArticlesDeep Reinforcement Learning | —Unverified | 0 |
| Mining Evidences for Concept Stock Recommendation | Jun 1, 2018 | Deep Reinforcement LearningInformation Retrieval | —Unverified | 0 |
| Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems | Jun 1, 2018 | Deep Reinforcement LearningMontezuma's Revenge | —Unverified | 0 |
| Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling | Jun 1, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update | May 31, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models | May 30, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 2 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Playing hard exploration games by watching YouTube | May 29, 2018 | Deep Reinforcement LearningMontezuma's Revenge | CodeCode Available | 1 |
| Observe and Look Further: Achieving Consistent Performance on Atari | May 29, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation | May 26, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning | May 24, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning of Marked Temporal Point Processes | May 23, 2018 | Deep Reinforcement LearningMarketing | CodeCode Available | 0 |
| Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents | May 22, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients | May 22, 2018 | Deep Reinforcement LearningDistributed Optimization | —Unverified | 0 |
| Verifiable Reinforcement Learning via Policy Extraction | May 22, 2018 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Evolution-Guided Policy Gradient in Reinforcement Learning | May 21, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation | May 21, 2018 | DecoderDeep Reinforcement Learning | —Unverified | 0 |
| Unsupervised Video Object Segmentation for Deep Reinforcement Learning | May 20, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| Solving the Rubik's Cube Without Human Knowledge | May 18, 2018 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Language Expansion In Text-Based Games | May 17, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Resource Management in Network Slicing | May 17, 2018 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| FollowNet: Robot Navigation by Following Natural Language Directions with Deep Reinforcement Learning | May 16, 2018 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning | May 16, 2018 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Do deep reinforcement learning agents model intentions? | May 15, 2018 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| General solutions for nonlinear differential equations: a rule-based self-learning approach using deep reinforcement learning | May 13, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes | May 11, 2018 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Optimal Control of Space Heating | May 10, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reward Estimation for Variance Reduction in Deep Reinforcement Learning | May 9, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Page-wise Recommendations | May 7, 2018 | Deep Reinforcement LearningRecommendation Systems | —Unverified | 0 |
| Ranking for Relevance and Display Preferences in Complex Presentation Layouts | May 7, 2018 | Deep Reinforcement LearningLearning-To-Rank | CodeCode Available | 0 |
| Deep Reinforcement Learning for Playing 2.5D Fighting Games | May 5, 2018 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning | May 4, 2018 | Collision AvoidanceDecision Making | CodeCode Available | 0 |
| Exploration by Distributional Reinforcement Learning | May 4, 2018 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| Robust Deep Reinforcement Learning for Security and Safety in Autonomous Vehicle Systems | May 2, 2018 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning | May 1, 2018 | Deep Reinforcement LearningDistributed Computing | —Unverified | 0 |
| Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments | Apr 27, 2018 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Sim-to-Real: Learning Agile Locomotion For Quadruped Robots | Apr 27, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Towards Symbolic Reinforcement Learning with Common Sense | Apr 23, 2018 | Common Sense ReasoningDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning to Extract Coherent Summary via Deep Reinforcement Learning | Apr 19, 2018 | Deep Reinforcement LearningExtractive Summarization | —Unverified | 0 |
| Cell Selection with Deep Reinforcement Learning in Sparse Mobile Crowdsensing | Apr 19, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Study on Overfitting in Deep Reinforcement Learning | Apr 18, 2018 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 0 |