SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Hype	Score
Causal Imitation Learning under Temporally Correlated Noise	Feb 2, 2022	EconometricsImitation Learning	CodeCode Available	1	5
Generalization Guarantees for Imitation Learning	Aug 5, 2020	Generalization BoundsImitation Learning	CodeCode Available	1	5
EvIL: Evolution Strategies for Generalisable Imitation Learning	Jun 15, 2024	Behavioural cloningcontinuous-control	CodeCode Available	1	5
Globally Stable Neural Imitation Policies	Mar 7, 2024	Imitation Learning	CodeCode Available	1	5
Adversarial Option-Aware Hierarchical Imitation Learning	Jun 10, 2021	Imitation Learning	CodeCode Available	1	5
CDT: Cascading Decision Trees for Explainable Reinforcement Learning	Nov 15, 2020	Deep Reinforcement LearningExplainable Models	CodeCode Available	1	5
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization	Jun 23, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation	Nov 25, 2022	continuous-controlContinuous Control	CodeCode Available	1	5
Exact Combinatorial Optimization with Graph Convolutional Neural Networks	Jun 4, 2019	Combinatorial OptimizationImitation Learning	CodeCode Available	1	5
Bootstrapped Model Predictive Control	Mar 24, 2025	continuous-controlContinuous Control	CodeCode Available	1	5

Title

Status

Hype

Causal Imitation Learning under Temporally Correlated Noise

CodeCode Available

Generalization Guarantees for Imitation Learning

CodeCode Available

EvIL: Evolution Strategies for Generalisable Imitation Learning

CodeCode Available

Globally Stable Neural Imitation Policies

CodeCode Available

Adversarial Option-Aware Hierarchical Imitation Learning