| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Universal Successor Features for Transfer Reinforcement Learning | Jan 5, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Fast Adaptation to New Environments via Policy-Dynamics Value Functions | Jan 1, 2020 | MuJoCo | —Unverified | 0 |
| Inferring DQN structure for high-dimensional continuous control | Jan 1, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning | Dec 13, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Parareal with a Learned Coarse Model for Robotic Manipulation | Dec 12, 2019 | MuJoCo | —Unverified | 0 |
| Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online | Nov 19, 2019 | Continual Learningcontinuous-control | —Unverified | 0 |
| MANGA: Method Agnostic Neural-policy Generalization and Adaptation | Nov 19, 2019 | Imitation LearningMuJoCo | —Unverified | 0 |
| Gradientless Descent: High-Dimensional Zeroth-Order Optimization | Nov 14, 2019 | MuJoCoVocal Bursts Intensity Prediction | —Unverified | 0 |
| Multi-Path Policy Optimization | Nov 11, 2019 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Asynchronous Methods for Model-Based Reinforcement Learning | Oct 28, 2019 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales | Oct 23, 2019 | MuJoCoVariational Inference | CodeCode Available | 0 |
| VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning | Oct 18, 2019 | Meta-LearningMuJoCo | CodeCode Available | 1 |
| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Oct 14, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Oct 10, 2019 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Multi-step Greedy Reinforcement Learning Algorithms | Oct 7, 2019 | Continuous ControlGame of Go | —Unverified | 0 |
| Learning Calibratable Policies using Programmatic Style-Consistency | Oct 2, 2019 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Formal Language Constraints for Markov Decision Processes | Oct 2, 2019 | Atari GamesMuJoCo | CodeCode Available | 0 |
| Improving Sample Efficiency in Model-Free Reinforcement Learning from Images | Oct 2, 2019 | Image ReconstructionMuJoCo | CodeCode Available | 1 |
| Learning from Observations Using a Single Video Demonstration and Human Feedback | Sep 29, 2019 | MuJoCo | —Unverified | 0 |
| A Generalized Training Approach for Multiagent Learning | Sep 27, 2019 | MuJoCo | —Unverified | 0 |
| Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation | Sep 26, 2019 | MuJoCoMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| Deep exploration by novelty-pursuit with maximum state entropy | Sep 25, 2019 | Efficient ExplorationMuJoCo | —Unverified | 0 |