| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Universal Successor Features for Transfer Reinforcement Learning | Jan 5, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Fast Adaptation to New Environments via Policy-Dynamics Value Functions | Jan 1, 2020 | MuJoCo | —Unverified | 0 |
| Inferring DQN structure for high-dimensional continuous control | Jan 1, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning | Dec 13, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Parareal with a Learned Coarse Model for Robotic Manipulation | Dec 12, 2019 | MuJoCo | —Unverified | 0 |
| Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online | Nov 19, 2019 | Continual Learningcontinuous-control | —Unverified | 0 |
| MANGA: Method Agnostic Neural-policy Generalization and Adaptation | Nov 19, 2019 | Imitation LearningMuJoCo | —Unverified | 0 |
| Gradientless Descent: High-Dimensional Zeroth-Order Optimization | Nov 14, 2019 | MuJoCoVocal Bursts Intensity Prediction | —Unverified | 0 |
| Multi-Path Policy Optimization | Nov 11, 2019 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |