| Generalizing from a few environments in safety-critical reinforcement learning | Jul 2, 2019 | BlockingDeep Reinforcement Learning | —Unverified | 0 |
| Dynamic Face Video Segmentation via Reinforcement Learning | Jul 2, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| End-to-end Deep Reinforcement Learning Based Coreference Resolution | Jul 1, 2019 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Designing Deep Reinforcement Learning for Human Parameter Exploration | Jul 1, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model | Jul 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control | Jul 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Variational Quantum Circuits for Deep Reinforcement Learning | Jun 30, 2019 | BIG-bench Machine LearningDecision Making | CodeCode Available | 0 |
| Collaboration of AI Agents via Cooperative Multi-Agent Deep Reinforcement Learning | Jun 30, 2019 | counterfactualDeep Reinforcement Learning | —Unverified | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog | Jun 30, 2019 | Deep Reinforcement LearningOpen-Domain Dialog | —Unverified | 0 |
| On Training Flexible Robots using Deep Reinforcement Learning | Jun 29, 2019 | Deep Reinforcement LearningIndustrial Robots | —Unverified | 0 |
| Learning to Cope with Adversarial Attacks | Jun 28, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction | Jun 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Navigation of Colloidal Robots in an Unknown Environment via Deep Reinforcement Learning | Jun 26, 2019 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy | Jun 25, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning for Liquidation Strategy Analysis | Jun 24, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Conservative Policy Iteration | Jun 24, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Modern Deep Reinforcement Learning Algorithms | Jun 24, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction | Jun 21, 2019 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Cooperative Lane Changing via Deep Reinforcement Learning | Jun 20, 2019 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| When Multiple Agents Learn to Schedule: A Distributed Radio Resource Management Framework | Jun 20, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Global Routing | Jun 20, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Multi-user Resource Control with Deep Reinforcement Learning in IoT Edge Computing | Jun 19, 2019 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |
| DISCO: Influence Maximization Meets Network Embedding and Deep Learning | Jun 18, 2019 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | Jun 18, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 |
| Universal Successor Features Based Deep Reinforcement Learning for Navigation | Jun 17, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A gray-box approach for curriculum learning | Jun 17, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Self-Tuning Sectorization: Deep Reinforcement Learning Meets Broadcast Beam Optimization | Jun 14, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cross-View Policy Learning for Street Navigation | Jun 13, 2019 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Deep Reinforcement Learning for Cyber Security | Jun 13, 2019 | Deep Reinforcement LearningIntrusion Detection | —Unverified | 0 |
| Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards | Jun 13, 2019 | Deep Reinforcement LearningFriction | CodeCode Available | 0 |
| Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted Vehicular Networks | Jun 12, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning | Jun 11, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep learning control of artificial avatars in group coordination tasks | Jun 11, 2019 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing | Jun 10, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning | Jun 9, 2019 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| Transfer Learning by Modeling a Distribution over Policies | Jun 9, 2019 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Neural Heterogeneous Scheduler | Jun 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach | Jun 7, 2019 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies | Jun 6, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Multi-objective Optimization | Jun 6, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| An Extensible Interactive Interface for Agent Design | Jun 6, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Deep Q-Learning for Directed Acyclic Graph Generation | Jun 5, 2019 | Deep Reinforcement LearningGraph Generation | —Unverified | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Options as responses: Grounding behavioural hierarchies in multi-agent RL | Jun 4, 2019 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Learning dynamic polynomial proofs | Jun 4, 2019 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning with Low-Complexity Liquid State Machines | Jun 4, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |