SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Score
Active Policy Improvement from Multiple Black-box Oracles	Jun 17, 2023	Imitation LearningReinforcement Learning (RL)	CodeCode Available	5
Improving In-Context Learning with Reasoning Distillation	Apr 14, 2025	ARCData Augmentation	CodeCode Available	5
Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy	Feb 19, 2021	Active LearningHierarchical Reinforcement Learning	CodeCode Available	5
Imitation Learning from Observations under Transition Model Disparity	Apr 25, 2022	Imitation Learningmodel	CodeCode Available	5
Augmented Q Imitation Learning (AQIL)	Mar 31, 2020	Deep Reinforcement LearningImitation Learning	CodeCode Available	5
Imitation Learning from a Single Temporally Misaligned Video	Feb 8, 2025	Imitation Learning	CodeCode Available	5
Imitation Learning from Purified Demonstrations	Oct 11, 2023	Decision MakingImitation Learning	CodeCode Available	5
Imitation Learning for Intra-Day Power Grid Operation through Topology Actions	Jul 29, 2024	Imitation Learning	CodeCode Available	5
A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning	Dec 24, 2023	Imitation Learning	CodeCode Available	5
Imitation Learning for Neural Morphological String Transduction	Aug 31, 2018	Imitation LearningLemmatization	CodeCode Available	5

Title

Status

Hype

Active Policy Improvement from Multiple Black-box Oracles

CodeCode Available

Improving In-Context Learning with Reasoning Distillation

CodeCode Available

Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy

CodeCode Available

Imitation Learning from Observations under Transition Model Disparity

CodeCode Available

Augmented Q Imitation Learning (AQIL)