SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Score
Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery	Dec 10, 2024	Imitation Learning	CodeCode Available	5
Imitation learning with artificial neural networks for demand response with a heuristic control approach for heat pumps	Jul 16, 2024	Imitation Learning	CodeCode Available	5
Imitation Learning of Agenda-based Semantic Parsers	Jan 1, 2015	Imitation LearningQuestion Answering	CodeCode Available	5
Imitation Learning of Stabilizing Policies for Nonlinear Systems	Sep 22, 2021	Imitation Learning	CodeCode Available	5
Imitation Learning from Purified Demonstrations	Oct 11, 2023	Decision MakingImitation Learning	CodeCode Available	5
Out-of-Dynamics Imitation Learning from Multimodal Demonstrations	Nov 13, 2022	Imitation LearningMuJoCo	CodeCode Available	5
Active Policy Improvement from Multiple Black-box Oracles	Jun 17, 2023	Imitation LearningReinforcement Learning (RL)	CodeCode Available	5
Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker	Dec 28, 2024	Imitation LearningMeta-Learning	CodeCode Available	5
Pay Attention! - Robustifying a Deep Visuomotor Policy Through Task-Focused Visual Attention	Jun 1, 2019	Imitation LearningObject	CodeCode Available	5
Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators	Sep 16, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	5

Title

Status

Hype

Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery

CodeCode Available

Imitation learning with artificial neural networks for demand response with a heuristic control approach for heat pumps

CodeCode Available

Imitation Learning of Agenda-based Semantic Parsers

CodeCode Available

Imitation Learning of Stabilizing Policies for Nonlinear Systems

CodeCode Available

Imitation Learning from Purified Demonstrations