SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Hype	Score
CRIL: Continual Robot Imitation Learning via Generative and Prediction Model	Jun 17, 2021	Generative Adversarial NetworkImitation Learning	CodeCode Available	1	5
Everyone Deserves A Reward: Learning Customized Human Preferences	Sep 6, 2023	DiversityImitation Learning	CodeCode Available	1	5
Learning from Guided Play: Improving Exploration for Adversarial Imitation Learning with Simple Auxiliary Tasks	Dec 30, 2022	Imitation Learning	CodeCode Available	1	5
Planning for Sample Efficient Imitation Learning	Oct 18, 2022	Imitation Learning	CodeCode Available	1	5
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization	Jun 23, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
Learning to Extrapolate: A Transductive Approach	Apr 27, 2023	Imitation Learning	CodeCode Available	1	5
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation	Nov 25, 2022	continuous-controlContinuous Control	CodeCode Available	1	5
PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning	Jul 26, 2024	Autonomous DrivingImitation Learning	CodeCode Available	1	5
On a Connection Between Imitation Learning and RLHF	Mar 7, 2025	Imitation Learning	CodeCode Available	1	5
ZeroMimic: Distilling Robotic Manipulation Skills from Web Videos	Mar 31, 2025	Imitation Learning	CodeCode Available	1	5

Title

Status

Hype

CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

CodeCode Available

Everyone Deserves A Reward: Learning Customized Human Preferences

CodeCode Available

Learning from Guided Play: Improving Exploration for Adversarial Imitation Learning with Simple Auxiliary Tasks

CodeCode Available

Planning for Sample Efficient Imitation Learning

CodeCode Available

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization