| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Oct 14, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Primal Wasserstein Imitation Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout | Jan 26, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization | Aug 1, 2023 | Bilevel OptimizationDiversity | CodeCode Available | 0 |
| Proximal Policy Distillation | Jul 21, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari | Feb 24, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 |
| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods | Nov 6, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |