| Learning Invariant Representations for Reinforcement Learning without Reconstruction | Jun 18, 2020 | Causal InferenceMuJoCo | CodeCode Available | 1 |
| Converting Biomechanical Models from OpenSim to MuJoCo | Jun 17, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration | Jun 15, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 |
| Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration | Jun 5, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Delay-Aware Model-Based Reinforcement Learning for Continuous Control | May 11, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization | Apr 29, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| State-only Imitation with Transition Dynamics Mismatch | Feb 27, 2020 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning | Oct 18, 2019 | Meta-LearningMuJoCo | CodeCode Available | 1 |