| FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning | Oct 4, 2020 | GPUMuJoCo | CodeCode Available | 1 |
| Revisiting Design Choices in Proximal Policy Optimization | Sep 23, 2020 | MuJoCo | CodeCode Available | 1 |
| Sample-Efficient Automated Deep Reinforcement Learning | Sep 3, 2020 | Deep Reinforcement LearningHyperparameter Optimization | CodeCode Available | 1 |
| Imitation Learning with Sinkhorn Distances | Aug 20, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning through Adversarial Loss | Aug 5, 2020 | Adversarial AttackAtari Games | CodeCode Available | 1 |
| Nengo and low-power AI hardware for robust, embedded neurorobotics | Jul 20, 2020 | MuJoCo | CodeCode Available | 1 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Fast Adaptation via Policy-Dynamics Value Functions | Jul 6, 2020 | MuJoCo | CodeCode Available | 1 |
| Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient | Jul 3, 2020 | BenchmarkingMuJoCo | CodeCode Available | 1 |