SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status
Semi-Supervised One-Shot Imitation Learning	Aug 9, 2024	Few-Shot LearningImitation Learning	—Unverified
SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring	Apr 4, 2024	Imitation Learning	—Unverified
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking	Jun 8, 2023	Imitation LearningText Generation	—Unverified
Sequential Causal Imitation Learning with Unobserved Confounders	Aug 12, 2022	Decision MakingImitation Learning	—Unverified
Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models	Nov 2, 2020	Imitation Learningreinforcement-learning	—Unverified
Shared Multi-Task Imitation Learning for Indoor Self-Navigation	Aug 14, 2018	Imitation Learning	—Unverified
SHEF-MIME: Word-level Quality Estimation Using Imitation Learning	Aug 1, 2016	Feature EngineeringImitation Learning	—Unverified
SIMILE: Introducing Sequential Information towards More Effective Imitation Learning	May 1, 2019	Imitation LearningOpenAI Gym	—Unverified
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey	Sep 24, 2020	Deep Reinforcement LearningDomain Adaptation	—Unverified
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup	Feb 13, 2019	Imitation LearningReinforcement Learning	—Unverified

Title

Status

Hype

Semi-Supervised One-Shot Imitation Learning

—Unverified

SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring

—Unverified

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

—Unverified

Sequential Causal Imitation Learning with Unobserved Confounders

—Unverified

Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models