| Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees | Jul 10, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Video Summarisation by Classification with Deep Reinforcement Learning | Jul 9, 2018 | ClassificationDecision Making | —Unverified | 0 |
| Financial Trading as a Game: A Deep Reinforcement Learning Approach | Jul 8, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| End-to-End Race Driving with Deep Reinforcement Learning | Jul 6, 2018 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 |
| Arcades: A deep model for adaptive decision making in voice controlled smart-home | Jul 5, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Doom using Unsupervised Auxiliary Tasks | Jul 5, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Human-level performance in first-person multiplayer games with population-based deep reinforcement learning | Jul 3, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient | Jul 2, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Deep Reinforcement Learning for NLP | Jul 1, 2018 | Atari Gamescoreference-resolution | —Unverified | 0 |
| Learning to Drive in a Day | Jul 1, 2018 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling | Jul 1, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Towards Mixed Optimization for Reinforcement Learning with Program Synthesis | Jul 1, 2018 | Deep Reinforcement LearningProgram Repair | —Unverified | 0 |
| Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation | Jun 28, 2018 | ClusteringDeep Reinforcement Learning | CodeCode Available | 0 |
| QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation | Jun 27, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Adversarial Active Exploration for Inverse Dynamics Model Learning | Jun 26, 2018 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Accuracy-based Curriculum Learning in Deep Reinforcement Learning | Jun 25, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning: An Overview | Jun 23, 2018 | BIG-bench Machine LearningDeep Learning | —Unverified | 0 |
| Learning-to-Ask: Knowledge Acquisition via 20 Questions | Jun 22, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| A New Approach for Resource Scheduling with Deep Reinforcement Learning | Jun 21, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments | Jun 21, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 |
| Sim-to-Real Reinforcement Learning for Deformable Object Manipulation | Jun 20, 2018 | Deep Reinforcement LearningDeformable Object Manipulation | CodeCode Available | 0 |
| A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning | Jun 20, 2018 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Learning Policy Representations in Multiagent Systems | Jun 17, 2018 | Clusteringcontinuous-control | —Unverified | 0 |
| Improving width-based planning with compact policies | Jun 15, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Automated Image Data Preprocessing with Deep Reinforcement Learning | Jun 15, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Surprising Negative Results for Generative Adversarial Tree Search | Jun 15, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dynamic Urban Transportation Problems | Jun 14, 2018 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning with Human Strategies | Jun 12, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning | Jun 12, 2018 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Chinese Zero pronoun Resolution | Jun 10, 2018 | Chinese Zero Pronoun ResolutionDecision Making | CodeCode Available | 0 |
| Randomized Prior Functions for Deep Reinforcement Learning | Jun 8, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Conversational Recommender System | Jun 8, 2018 | Conversational RecommendationDeep Reinforcement Learning | CodeCode Available | 0 |
| Automatic View Planning with Multi-scale Deep Reinforcement Learning Agents | Jun 8, 2018 | AnatomyDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for General Video Game AI | Jun 6, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 |
| Learning to Understand Goal Specifications by Modelling Reward | Jun 5, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Relational Deep Reinforcement Learning | Jun 5, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Playing Atari with Six Neurons | Jun 4, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| Relational inductive bias for physical construction in humans and machines | Jun 4, 2018 | Deep Reinforcement LearningInductive Bias | —Unverified | 0 |
| TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning | Jun 4, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles | Jun 4, 2018 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Mitigation of Policy Manipulation Attacks on Deep Q-Networks with Parameter-Space Noise | Jun 4, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Building Advanced Dialogue Managers for Goal-Oriented Dialogue Systems | Jun 3, 2018 | Deep Reinforcement LearningGoal-Oriented Dialogue Systems | —Unverified | 0 |
| DAQN: Deep Auto-encoder and Q-Network | Jun 2, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Entropy for Policy Gradient with Multidimensional Action Space | Jun 2, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning of Region Proposal Networks for Object Detection | Jun 1, 2018 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning | Jun 1, 2018 | BinarizationDeep Reinforcement Learning | —Unverified | 0 |
| SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation | Jun 1, 2018 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition | Jun 1, 2018 | Action RecognitionDeep Reinforcement Learning | —Unverified | 0 |
| Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling | Jun 1, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |