Generative Adversarial Imitation Learning

2016-06-10NeurIPS 2016Code Available1· sign in to hype

Jonathan Ho, Stefano Ermon

Code Available — Be the first to reproduce this paper.

Code

github.com/twni2016/f-IRL
pytorch★ 45
github.com/ran-weii/cleanil
pytorch★ 24
github.com/emunaran/stochastic-human-driving-policies-drl
pytorch★ 16
github.com/KAIST-AILab/deeprl_practice_colab
none★ 8
github.com/Techget/gail-tf-sc2
tf★ 7
github.com/rohitrango/Reward-bias-in-GAIL
tf★ 4
github.com/170928/-Review-Generative-Adversarial-Imitation-Learning
tf★ 0
github.com/Khrylx/PyTorch-RL
pytorch★ 0
github.com/sisl/ngsim_env
tf★ 0
github.com/morikatron/GAIL_PPO
tf★ 0

Abstract

Consider learning a policy from example expert behavior, without interaction with the expert or access to reinforcement signal. One approach is to recover the expert's cost function with inverse reinforcement learning, then extract a policy from that cost function with reinforcement learning. This approach is indirect and can be slow. We propose a new general framework for directly extracting a policy from data, as if it were obtained by reinforcement learning following inverse reinforcement learning. We show that a certain instantiation of our framework draws an analogy between imitation learning and generative adversarial networks, from which we derive a model-free imitation learning algorithm that obtains significant performance gains over existing model-free methods in imitating complex behaviors in large, high-dimensional environments.

Tasks

Imitation Learning reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Generative Adversarial Imitation Learning

Code

Abstract

Tasks

Reproductions