SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Hype
MANGA: Method Agnostic Neural-policy Generalization and Adaptation	Nov 19, 2019	Imitation LearningMuJoCo	—Unverified	0
On Value Discrepancy of Imitation Learning	Nov 16, 2019	Imitation LearningReinforcement Learning	—Unverified	0
Motion Reasoning for Goal-Based Imitation Learning	Nov 13, 2019	Imitation LearningMotion Planning	—Unverified	0
Accelerating Training in Pommerman with Imitation and Reinforcement Learning	Nov 12, 2019	Imitation Learningreinforcement-learning	—Unverified	0
A Divergence Minimization Perspective on Imitation Learning Methods	Nov 6, 2019	Behavioural cloningcontinuous-control	CodeCode Available	1
Learning One-Shot Imitation from Humans without Humans	Nov 4, 2019	Imitation LearningMeta-Learning	CodeCode Available	0
Learning from Trajectories via Subgoal Discovery	Nov 3, 2019	Imitation LearningReinforcement Learning	CodeCode Available	0
DIVINE: A Generative Adversarial Imitation Learning Framework for Knowledge Graph Reasoning	Nov 1, 2019	Imitation LearningKnowledge Graphs	—Unverified	0
Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning	Nov 1, 2019	Imitation Learningreinforcement-learning	—Unverified	0
Positive-Unlabeled Reward Learning	Nov 1, 2019	Imitation LearningReinforcement Learning	—Unverified	0

Title

Status

Hype

MANGA: Method Agnostic Neural-policy Generalization and Adaptation

—Unverified

On Value Discrepancy of Imitation Learning

—Unverified

Motion Reasoning for Goal-Based Imitation Learning

—Unverified

Accelerating Training in Pommerman with Imitation and Reinforcement Learning

—Unverified

A Divergence Minimization Perspective on Imitation Learning Methods