| Learning from Observations Using a Single Video Demonstration and Human Feedback | Sep 29, 2019 | MuJoCo | —Unverified | 0 |
| A Generalized Training Approach for Multiagent Learning | Sep 27, 2019 | MuJoCo | —Unverified | 0 |
| Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation | Sep 26, 2019 | MuJoCoMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| Deep exploration by novelty-pursuit with maximum state entropy | Sep 25, 2019 | Efficient ExplorationMuJoCo | —Unverified | 0 |
| Regulatory Focus: Promotion and Prevention Inclinations in Policy Search | Sep 25, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| CrossNorm: On Normalization for Off-Policy Reinforcement Learning | Sep 25, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Bootstrapping the Expressivity with Model-based Planning | Sep 25, 2019 | modelMuJoCo | CodeCode Available | 0 |