| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | Jun 18, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 |
| Learning Powerful Policies by Using Consistent Dynamics Model | Jun 11, 2019 | Atari Gamesmodel | CodeCode Available | 0 |
| Self-Supervised Exploration via Disagreement | Jun 10, 2019 | Active LearningEfficient Exploration | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy | May 28, 2019 | counterfactualEfficient Exploration | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards | May 27, 2019 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Imitation Learning from Video by Leveraging Proprioception | May 22, 2019 | Imitation LearningMuJoCo | —Unverified | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| P3O: Policy-on Policy-off Policy Optimization | May 5, 2019 | MuJoCoReinforcement Learning | CodeCode Available | 0 |
| Collaborative Evolutionary Reinforcement Learning | May 2, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies | May 1, 2019 | MuJoCo | —Unverified | 0 |
| SUPERVISED POLICY UPDATE | May 1, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Composing Complex Skills by Learning Transition Policies with Proximity Reward Induction | May 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation | Apr 14, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations | Apr 12, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Generalized Off-Policy Actor-Critic | Mar 27, 2019 | counterfactualMuJoCo | —Unverified | 0 |
| Towards Characterizing Divergence in Deep Q-Learning | Mar 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Provably Robust Blackbox Optimization for Reinforcement Learning | Mar 7, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| α-Rank: Multi-Agent Evaluation by Evolution | Mar 4, 2019 | Mathematical ProofsMuJoCo | —Unverified | 0 |
| Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments | Mar 3, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learn a Prior for RHEA for Better Online Planning | Feb 14, 2019 | Evolutionary AlgorithmsMuJoCo | —Unverified | 0 |