| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Oct 14, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Oct 10, 2019 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Multi-step Greedy Reinforcement Learning Algorithms | Oct 7, 2019 | Continuous ControlGame of Go | —Unverified | 0 |
| Learning Calibratable Policies using Programmatic Style-Consistency | Oct 2, 2019 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Formal Language Constraints for Markov Decision Processes | Oct 2, 2019 | Atari GamesMuJoCo | CodeCode Available | 0 |
| Learning from Observations Using a Single Video Demonstration and Human Feedback | Sep 29, 2019 | MuJoCo | —Unverified | 0 |
| A Generalized Training Approach for Multiagent Learning | Sep 27, 2019 | MuJoCo | —Unverified | 0 |
| Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation | Sep 26, 2019 | MuJoCoMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Safe Policy Learning for Continuous Control | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Regulatory Focus: Promotion and Prevention Inclinations in Policy Search | Sep 25, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Multi-task Batch Reinforcement Learning with Metric Learning | Sep 25, 2019 | Meta Reinforcement LearningMetric Learning | —Unverified | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Bootstrapping the Expressivity with Model-based Planning | Sep 25, 2019 | modelMuJoCo | CodeCode Available | 0 |
| Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Latent Representations for Inverse Dynamics using Generalized Experiences | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Deep exploration by novelty-pursuit with maximum state entropy | Sep 25, 2019 | Efficient ExplorationMuJoCo | —Unverified | 0 |
| CrossNorm: On Normalization for Off-Policy Reinforcement Learning | Sep 25, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning | Sep 17, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning | Sep 17, 2019 | MuJoCoOpenAI Gym | CodeCode Available | 0 |
| Biased Estimates of Advantages over Path Ensembles | Sep 15, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning | Sep 11, 2019 | MuJoCoQ-Learning | —Unverified | 0 |