| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | Jun 18, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 |
| Learning Powerful Policies by Using Consistent Dynamics Model | Jun 11, 2019 | Atari Gamesmodel | CodeCode Available | 0 |
| Self-Supervised Exploration via Disagreement | Jun 10, 2019 | Active LearningEfficient Exploration | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy | May 28, 2019 | counterfactualEfficient Exploration | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards | May 27, 2019 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Imitation Learning from Video by Leveraging Proprioception | May 22, 2019 | Imitation LearningMuJoCo | —Unverified | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| P3O: Policy-on Policy-off Policy Optimization | May 5, 2019 | MuJoCoReinforcement Learning | CodeCode Available | 0 |
| Collaborative Evolutionary Reinforcement Learning | May 2, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies | May 1, 2019 | MuJoCo | —Unverified | 0 |
| SUPERVISED POLICY UPDATE | May 1, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Composing Complex Skills by Learning Transition Policies with Proximity Reward Induction | May 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation | Apr 14, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations | Apr 12, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Generalized Off-Policy Actor-Critic | Mar 27, 2019 | counterfactualMuJoCo | —Unverified | 0 |
| Towards Characterizing Divergence in Deep Q-Learning | Mar 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Provably Robust Blackbox Optimization for Reinforcement Learning | Mar 7, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| α-Rank: Multi-Agent Evaluation by Evolution | Mar 4, 2019 | Mathematical ProofsMuJoCo | —Unverified | 0 |
| Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments | Mar 3, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learn a Prior for RHEA for Better Online Planning | Feb 14, 2019 | Evolutionary AlgorithmsMuJoCo | —Unverified | 0 |
| The StarCraft Multi-Agent Challenge | Feb 11, 2019 | BenchmarkingMuJoCo | CodeCode Available | 1 |
| Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning | Jan 31, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Lyapunov-based Safe Policy Optimization for Continuous Control | Jan 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Action Robust Reinforcement Learning and Applications in Continuous Control | Jan 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WALL-E: An Efficient Reinforcement Learning Research Framework | Jan 18, 2019 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Meta Reinforcement Learning with Distribution of Exploration Parameters Learned by Evolution Strategies | Dec 29, 2018 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning | Dec 21, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| A Logarithmic Barrier Method For Proximal Policy Optimization | Dec 16, 2018 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Residual Policy Learning | Dec 15, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Generative Adversarial Self-Imitation Learning | Dec 3, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Simple random search of static linear policies is competitive for reinforcement learning | Dec 1, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| BlockPuzzle - A Challenge in Physical Reasoning and Generalization for Robot Learning | Nov 30, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning | Nov 22, 2018 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Episodic Curiosity through Reachability | Oct 4, 2018 | MuJoCoReinforcement Learning | CodeCode Available | 0 |
| Understanding the Asymptotic Performance of Model-Based RL Methods | Sep 27, 2018 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improving On-policy Learning with Statistical Reward Accumulation | Sep 7, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Risk-Sensitive Generative Adversarial Imitation Learning | Aug 13, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| ToriLLE: Learning Environment for Hand-to-Hand Combat | Jul 26, 2018 | BIG-bench Machine LearningMuJoCo | CodeCode Available | 0 |
| EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning | Jul 22, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Variance Reduction for Reinforcement Learning in Input-Driven Environments | Jul 6, 2018 | Meta-LearningMuJoCo | —Unverified | 0 |
| Self-Imitation Learning | Jun 14, 2018 | Atari GamesImitation Learning | CodeCode Available | 0 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Optimization with Second-Order Advantage Information | May 9, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |