| The StarCraft Multi-Agent Challenge | Feb 11, 2019 | BenchmarkingMuJoCo | CodeCode Available | 1 |
| Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning | Jan 31, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Lyapunov-based Safe Policy Optimization for Continuous Control | Jan 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Action Robust Reinforcement Learning and Applications in Continuous Control | Jan 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WALL-E: An Efficient Reinforcement Learning Research Framework | Jan 18, 2019 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Meta Reinforcement Learning with Distribution of Exploration Parameters Learned by Evolution Strategies | Dec 29, 2018 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning | Dec 21, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| A Logarithmic Barrier Method For Proximal Policy Optimization | Dec 16, 2018 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Residual Policy Learning | Dec 15, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Generative Adversarial Self-Imitation Learning | Dec 3, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Simple random search of static linear policies is competitive for reinforcement learning | Dec 1, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| BlockPuzzle - A Challenge in Physical Reasoning and Generalization for Robot Learning | Nov 30, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning | Nov 22, 2018 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Episodic Curiosity through Reachability | Oct 4, 2018 | MuJoCoReinforcement Learning | CodeCode Available | 0 |
| Understanding the Asymptotic Performance of Model-Based RL Methods | Sep 27, 2018 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improving On-policy Learning with Statistical Reward Accumulation | Sep 7, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Risk-Sensitive Generative Adversarial Imitation Learning | Aug 13, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| ToriLLE: Learning Environment for Hand-to-Hand Combat | Jul 26, 2018 | BIG-bench Machine LearningMuJoCo | CodeCode Available | 0 |
| EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning | Jul 22, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Variance Reduction for Reinforcement Learning in Input-Driven Environments | Jul 6, 2018 | Meta-LearningMuJoCo | —Unverified | 0 |
| Self-Imitation Learning | Jun 14, 2018 | Atari GamesImitation Learning | CodeCode Available | 0 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Optimization with Second-Order Advantage Information | May 9, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |