| Trust Region-Guided Proximal Policy Optimization | Jan 29, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Designing a Multi-Objective Reward Function for Creating Teams of Robotic Bodyguards Using Deep Reinforcement Learning | Jan 28, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Making Deep Q-learning methods robust to time discretization | Jan 28, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift | Jan 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Action Robust Reinforcement Learning and Applications in Continuous Control | Jan 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization | Jan 25, 2019 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Emergent Linguistic Phenomena in Multi-Agent Communication Games | Jan 25, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Dynamic Measurement Scheduling for Event Forecasting using Deep RL | Jan 24, 2019 | Deep Reinforcement LearningICU Mortality | CodeCode Available | 0 |
| Combinational Q-Learning for Dou Di Zhu | Jan 24, 2019 | Atari GamesCard Games | CodeCode Available | 0 |
| Federated Deep Reinforcement Learning | Jan 24, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Never Forget: Balancing Exploration and Exploitation via Learning Optical Flow | Jan 24, 2019 | Deep Reinforcement LearningOptical Flow Estimation | —Unverified | 0 |
| Distillation Strategies for Proximal Policy Optimization | Jan 23, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning | Jan 22, 2019 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target | Jan 22, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning retrosynthetic planning through self-play | Jan 19, 2019 | Deep Reinforcement LearningMulti-step retrosynthesis | —Unverified | 0 |
| On-Policy Trust Region Policy Optimisation with Replay Buffers | Jan 18, 2019 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 0 |
| Evolutionarily-Curated Curriculum Learning for Deep Reinforcement Learning Agents | Jan 16, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Energy-Efficient Thermal Comfort Control in Smart Buildings via Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven Communication | Jan 12, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| A New Tensioning Method using Deep Reinforcement Learning for Surgical Pattern Cutting | Jan 10, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Uncertainty-Based Out-of-Distribution Detection in Deep Reinforcement Learning | Jan 8, 2019 | Bayesian InferenceDeep Reinforcement Learning | —Unverified | 0 |
| A* Tree Search for Portfolio Management | Jan 7, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Self-Learning Exploration and Mapping for Mobile Robots via Deep Reinforcement Learning | Jan 6, 2019 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 0 |
| What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning | Jan 6, 2019 | Deep Reinforcement LearningQuestion Answering | —Unverified | 0 |
| Recurrent Control Nets for Deep Reinforcement Learning | Jan 6, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Exploring applications of deep reinforcement learning for real-world autonomous driving systems | Jan 6, 2019 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 |
| Human-Like Autonomous Car-Following Model with Deep Reinforcement Learning | Jan 3, 2019 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Complementary reinforcement learning towards explainable agents | Jan 1, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Theoretical Analysis of Deep Q-Learning | Jan 1, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications | Dec 31, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning to Design RNA | Dec 31, 2018 | CPUDeep Reinforcement Learning | CodeCode Available | 0 |
| Learn to Interpret Atari Agents | Dec 29, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Dynamic Planning Networks | Dec 28, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Dealing with Limited Backhaul Capacity in Millimeter Wave Systems: A Deep Reinforcement Learning Approach | Dec 27, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Quantum Adiabatic Algorithm Design using Reinforcement Learning | Dec 27, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Walk via Deep Reinforcement Learning | Dec 26, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A New Concept of Deep Reinforcement Learning based Augmented General Sequence Tagging System | Dec 26, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control | Dec 24, 2018 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Parallelized Interactive Machine Learning on Autonomous Vehicles | Dec 23, 2018 | Autonomous VehiclesBIG-bench Machine Learning | —Unverified | 0 |
| NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning | Dec 21, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning | Dec 21, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Learning to Navigate the Web | Dec 21, 2018 | Deep Reinforcement LearningInstruction Following | —Unverified | 0 |
| Domain Adaptation for Reinforcement Learning on the Atari | Dec 18, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep reinforcement learning for search, recommendation, and online advertising: a survey | Dec 18, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Information-Directed Exploration for Deep Reinforcement Learning | Dec 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents | Dec 17, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning Approach | Dec 16, 2018 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 0 |