| QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Evo-NAS: Evolutionary-Neural Hybrid Agent for Architecture Search | Sep 25, 2019 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| Assessing Generalization in TD methods for Deep Reinforcement Learning | Sep 25, 2019 | Deep Reinforcement LearningMemorization | —Unverified | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Zero-Shot Policy Transfer with Disentangled Attention | Sep 25, 2019 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 |
| Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| HIPPOCAMPAL NEURONAL REPRESENTATIONS IN CONTINUAL LEARNING | Sep 25, 2019 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Learning Semantically Meaningful Representations Through Embodiment | Sep 25, 2019 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Exploration of Deep Reinforcement Learning using Planning for Policy Search | Sep 25, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Learning Latent Representations for Inverse Dynamics using Generalized Experiences | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Long-term planning, short-term adjustments | Sep 25, 2019 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| D3PG: Deep Differentiable Deterministic Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Learning Key Steps to Attack Deep Reinforcement Learning Agents | Sep 25, 2019 | Adversarial AttackAtari Games | —Unverified | 0 |
| Do recent advancements in model-based deep reinforcement learning really improve data efficiency? | Sep 25, 2019 | Atari Games 100kDeep Reinforcement Learning | —Unverified | 0 |
| C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning | Sep 25, 2019 | Deep Reinforcement Learningmotion retargeting | CodeCode Available | 0 |
| Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone Microcontroller | Sep 25, 2019 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Auto-Deferring Policy for Combinatorial Optimization | Sep 25, 2019 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| How many weights are enough : can tensor factorization learn efficient policies ? | Sep 25, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| ROS-HPL: Robotic Object Search with Hierarchical Policy Learning and Intrinsic-Extrinsic Modeling | Sep 25, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| MoET: Interpretable and Verifiable Reinforcement Learning via Mixture of Expert Trees | Sep 25, 2019 | Deep Reinforcement LearningGame of Go | —Unverified | 0 |
| Controlling an Autonomous Vehicle with Deep Reinforcement Learning | Sep 24, 2019 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning | Sep 24, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches | Sep 24, 2019 | Deep Reinforcement LearningFairness | —Unverified | 0 |