| Generalizing from a few environments in safety-critical reinforcement learning | Jul 2, 2019 | BlockingDeep Reinforcement Learning | —Unverified | 0 |
| Dynamic Face Video Segmentation via Reinforcement Learning | Jul 2, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| End-to-end Deep Reinforcement Learning Based Coreference Resolution | Jul 1, 2019 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Designing Deep Reinforcement Learning for Human Parameter Exploration | Jul 1, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model | Jul 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control | Jul 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Variational Quantum Circuits for Deep Reinforcement Learning | Jun 30, 2019 | BIG-bench Machine LearningDecision Making | CodeCode Available | 0 |
| Collaboration of AI Agents via Cooperative Multi-Agent Deep Reinforcement Learning | Jun 30, 2019 | counterfactualDeep Reinforcement Learning | —Unverified | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog | Jun 30, 2019 | Deep Reinforcement LearningOpen-Domain Dialog | —Unverified | 0 |
| On Training Flexible Robots using Deep Reinforcement Learning | Jun 29, 2019 | Deep Reinforcement LearningIndustrial Robots | —Unverified | 0 |
| Learning to Cope with Adversarial Attacks | Jun 28, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction | Jun 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Navigation of Colloidal Robots in an Unknown Environment via Deep Reinforcement Learning | Jun 26, 2019 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy | Jun 25, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning for Liquidation Strategy Analysis | Jun 24, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Conservative Policy Iteration | Jun 24, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Modern Deep Reinforcement Learning Algorithms | Jun 24, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction | Jun 21, 2019 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Cooperative Lane Changing via Deep Reinforcement Learning | Jun 20, 2019 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| When Multiple Agents Learn to Schedule: A Distributed Radio Resource Management Framework | Jun 20, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Global Routing | Jun 20, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Multi-user Resource Control with Deep Reinforcement Learning in IoT Edge Computing | Jun 19, 2019 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |