SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Hype
Imitation Learning for Sentence Generation with Dilated Convolutions Using Adversarial Training	Aug 15, 2019	DiversityGenerative Adversarial Network	CodeCode Available	0
Learning Vision-based Flight in Drone Swarms by Imitation	Aug 8, 2019	Collision AvoidanceDomain Adaptation	—Unverified	0
Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents	Aug 6, 2019	Imitation LearningQ-Learning	—Unverified	0
Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning	Aug 6, 2019	Imitation Learning	CodeCode Available	0
Learning to combine primitive skills: A step towards versatile robotic manipulation	Aug 2, 2019	Data AugmentationImitation Learning	CodeCode Available	1
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation	Aug 1, 2019	Decision MakingImitation Learning	—Unverified	0
Self-Imitation Learning of Locomotion Movements through Termination Curriculum	Jul 27, 2019	Imitation LearningReinforcement Learning	CodeCode Available	0
Deep Reinforcement Learning for Personalized Search Story Recommendation	Jul 26, 2019	Deep Reinforcement LearningImage Retrieval	—Unverified	0
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards	Jul 24, 2019	Atari GamesDiversity	—Unverified	0
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic Experts	Jul 24, 2019	Imitation Learningreinforcement-learning	—Unverified	0

Title

Status

Hype

Imitation Learning for Sentence Generation with Dilated Convolutions Using Adversarial Training

CodeCode Available

Learning Vision-based Flight in Drone Swarms by Imitation

—Unverified

Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents

—Unverified

Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning

CodeCode Available

Learning to combine primitive skills: A step towards versatile robotic manipulation