| Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation | Oct 11, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression and Challenge | Oct 11, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning | Oct 11, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Efficient Intrinsically Motivated Robotic Grasping with Learning-Adaptive Imagination in Latent Space | Oct 10, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Hierarchical Deep Double Q-Routing | Oct 9, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Defensive Escort Teams via Multi-Agent Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals | Oct 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Policies Modulating Trajectory Generators | Oct 7, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| DeepMNavigate: Deep Reinforced Multi-Robot Navigation Unifying Local & Global Collision Avoidance | Oct 4, 2019 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning | Oct 2, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement Learning | Oct 2, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots | Oct 2, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification | Oct 1, 2019 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| End-to-End Motion Planning of Quadrotors Using Deep Reinforcement Learning | Sep 30, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving | Sep 30, 2019 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning | Sep 30, 2019 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Relational Graph Learning for Crowd Navigation | Sep 28, 2019 | Deep Reinforcement LearningGraph Learning | CodeCode Available | 0 |
| How to Evaluate Machine Learning Approaches for Combinatorial Optimization: Application to the Travelling Salesman Problem | Sep 28, 2019 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 |
| Deep Reinforcement Learning Based Power control for Wireless Multicast Systems | Sep 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning | Sep 27, 2019 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Counterfactual States for Atari Agents via Generative Deep Learning | Sep 27, 2019 | counterfactualDecision Making | —Unverified | 0 |
| Harnessing Structures for Value-Based Planning and Reinforcement Learning | Sep 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control | Sep 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Evo-NAS: Evolutionary-Neural Hybrid Agent for Architecture Search | Sep 25, 2019 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| Assessing Generalization in TD methods for Deep Reinforcement Learning | Sep 25, 2019 | Deep Reinforcement LearningMemorization | —Unverified | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Zero-Shot Policy Transfer with Disentangled Attention | Sep 25, 2019 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 |
| Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| HIPPOCAMPAL NEURONAL REPRESENTATIONS IN CONTINUAL LEARNING | Sep 25, 2019 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Learning Semantically Meaningful Representations Through Embodiment | Sep 25, 2019 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Exploration of Deep Reinforcement Learning using Planning for Policy Search | Sep 25, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Learning Latent Representations for Inverse Dynamics using Generalized Experiences | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Long-term planning, short-term adjustments | Sep 25, 2019 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| D3PG: Deep Differentiable Deterministic Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Learning Key Steps to Attack Deep Reinforcement Learning Agents | Sep 25, 2019 | Adversarial AttackAtari Games | —Unverified | 0 |
| Do recent advancements in model-based deep reinforcement learning really improve data efficiency? | Sep 25, 2019 | Atari Games 100kDeep Reinforcement Learning | —Unverified | 0 |
| C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning | Sep 25, 2019 | Deep Reinforcement Learningmotion retargeting | CodeCode Available | 0 |
| Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone Microcontroller | Sep 25, 2019 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Auto-Deferring Policy for Combinatorial Optimization | Sep 25, 2019 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| How many weights are enough : can tensor factorization learn efficient policies ? | Sep 25, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| ROS-HPL: Robotic Object Search with Hierarchical Policy Learning and Intrinsic-Extrinsic Modeling | Sep 25, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| MoET: Interpretable and Verifiable Reinforcement Learning via Mixture of Expert Trees | Sep 25, 2019 | Deep Reinforcement LearningGame of Go | —Unverified | 0 |
| Controlling an Autonomous Vehicle with Deep Reinforcement Learning | Sep 24, 2019 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning | Sep 24, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches | Sep 24, 2019 | Deep Reinforcement LearningFairness | —Unverified | 0 |