| Improving On-policy Learning with Statistical Reward Accumulation | Sep 7, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Risk-Sensitive Generative Adversarial Imitation Learning | Aug 13, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| ToriLLE: Learning Environment for Hand-to-Hand Combat | Jul 26, 2018 | BIG-bench Machine LearningMuJoCo | CodeCode Available | 0 |
| EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning | Jul 22, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Variance Reduction for Reinforcement Learning in Input-Driven Environments | Jul 6, 2018 | Meta-LearningMuJoCo | —Unverified | 0 |
| Self-Imitation Learning | Jun 14, 2018 | Atari GamesImitation Learning | CodeCode Available | 0 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Optimization with Second-Order Advantage Information | May 9, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |