SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status
EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning	Jul 22, 2018	Imitation LearningMuJoCo	—Unverified
Generative Adversarial Imitation from Observation	Jul 17, 2018	Imitation Learning	CodeCode Available
Bipedal Walking Robot using Deep Deterministic Policy Gradient	Jul 16, 2018	BIG-bench Machine LearningDecision Making	CodeCode Available
Extracting Contact and Motion from Manipulation Videos	Jul 13, 2018	ClusteringImitation Learning	—Unverified
CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving	Jul 10, 2018	Imitation Learningreinforcement-learning	—Unverified
Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control	Jul 1, 2018	Imitation LearningReinforcement Learning	CodeCode Available
Learning How to Actively Learn: A Deep Imitation Learning Approach	Jul 1, 2018	Active LearningGeneral Classification	CodeCode Available
End-to-End Deep Imitation Learning: Robot Soccer Case Study	Jun 28, 2018	Imitation Learning	—Unverified
The Virtuous Machine - Old Ethics for New Technology?	Jun 27, 2018	Autonomous DrivingEthics	—Unverified
Learning Existing Social Conventions via Observationally Augmented Self-Play	Jun 26, 2018	Imitation LearningMulti-agent Reinforcement Learning	—Unverified

Title

Status

Hype

EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning

—Unverified

Generative Adversarial Imitation from Observation

CodeCode Available

Bipedal Walking Robot using Deep Deterministic Policy Gradient

CodeCode Available

Extracting Contact and Motion from Manipulation Videos

—Unverified

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving