| Consequentialist conditional cooperation in social dilemmas with imperfect information | Oct 19, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Asymmetric Actor Critic for Image-Based Robot Learning | Oct 18, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning | Oct 17, 2017 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations | Oct 10, 2017 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Abstract Q-Networks | Oct 2, 2017 | Deep Reinforcement LearningMontezuma's Revenge | —Unverified | 0 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning | Oct 1, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Attention-Aware Deep Reinforcement Learning for Video Face Recognition | Oct 1, 2017 | Deep Reinforcement LearningFace Recognition | —Unverified | 0 |
| Vision-based deep execution monitoring | Sep 29, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation | Sep 29, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning | Sep 28, 2017 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning | Sep 25, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Unmanned Aerial Vehicle Control for Autonomous Target Following | Sep 24, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World | Sep 22, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning | Sep 21, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks | Sep 20, 2017 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| A Deep-Reinforcement Learning Approach for Software-Defined Networking Routing Optimization | Sep 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes | Sep 19, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning that Matters | Sep 19, 2017 | Atari GamesContinuous Control | CodeCode Available | 0 |
| Guided Deep Reinforcement Learning for Swarm Systems | Sep 18, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models | Sep 18, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Conversational AI | Sep 15, 2017 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning | Sep 15, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Shared Learning : Enhancing Reinforcement in Q-Ensembles | Sep 14, 2017 | Atari Gamescontinuous-control | —Unverified | 0 |
| Towards personalized human AI interaction - adapting the behavior of AI agents using neural signatures of subjective interest | Sep 14, 2017 | AI AgentBrain Computer Interface | —Unverified | 0 |
| A Study of AI Population Dynamics with Million-agent Reinforcement Learning | Sep 13, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning | Sep 12, 2017 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds | Sep 12, 2017 | Deep Reinforcement LearningMinecraft | —Unverified | 0 |
| Deep Reinforcement Learning with Surrogate Agent-Environment Interface | Sep 12, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Autonomous Quadrotor Landing using Deep Reinforcement Learning | Sep 11, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Prosocial learning agents solve generalized Stag Hunts better than selfish ones | Sep 8, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| A Deep Reinforcement Learning Chatbot | Sep 7, 2017 | ChatbotDeep Reinforcement Learning | —Unverified | 0 |
| Formulation of Deep Reinforcement Learning Architecture Toward Autonomous Driving for On-Ramp Merge | Sep 7, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning | Sep 5, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Mechanism Design for e-commerce | Aug 25, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Fake News in Social Networks | Aug 21, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| A Deep Q-Network for the Beer Game: A Deep Reinforcement Learning algorithm to Solve Inventory Optimization Problems | Aug 20, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method | Aug 20, 2017 | 3D Bin PackingCombinatorial Optimization | —Unverified | 0 |
| A Brief Survey of Deep Reinforcement Learning | Aug 19, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions | Aug 18, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| StarCraft II: A New Challenge for Reinforcement Learning | Aug 16, 2017 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 0 |
| Deep Reinforcement Learning for High Precision Assembly Tasks | Aug 14, 2017 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| A Machine Learning Approach to Routing | Aug 10, 2017 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Attention-Aware Face Hallucination via Deep Reinforcement Learning | Aug 10, 2017 | Deep Reinforcement LearningFace Hallucination | —Unverified | 0 |
| Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control | Aug 10, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learning how to Active Learn: A Deep Reinforcement Learning Approach | Aug 8, 2017 | Active LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning | Aug 8, 2017 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |