| A Game-Theoretic Approach to Multi-Agent Trust Region Optimization | Jun 12, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Keyframe-Focused Visual Imitation Learning | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL | Jun 9, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Average-Reward Reinforcement Learning with Trust Region Methods | Jun 7, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| DisTop: Discovering a Topological representation to learn diverse and rewarding skills | Jun 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage | Jun 6, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching | Jun 6, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture | May 28, 2021 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage | May 21, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Regret Minimization Experience Replay in Off-Policy Reinforcement Learning | May 15, 2021 | MuJoCoreinforcement-learning | CodeCode Available | 0 |