SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status	Hype
Visual-based Autonomous Driving Deployment from a Stochastic and Uncertainty-aware Perspective	Mar 3, 2019	Autonomous DrivingDomain Adaptation	CodeCode Available	0
GRP Model for Sensorimotor Learning	Mar 1, 2019	Imitation Learningmodel	—Unverified	0
Learning Dynamic-Objective Policies from a Class of Optimal Trajectories	Feb 27, 2019	Imitation Learning	—Unverified	0
Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning	Feb 15, 2019	Deep Reinforcement LearningImitation Learning	CodeCode Available	0
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup	Feb 13, 2019	Imitation LearningReinforcement Learning	—Unverified	0
Artificial Intelligence for Prosthetics - challenge solutions	Feb 7, 2019	Deep Reinforcement LearningImitation Learning	CodeCode Available	0
Decentralized Multi-Agents by Imitation of a Centralized Controller	Feb 6, 2019	Imitation LearningMulti-agent Reinforcement Learning	—Unverified	0
Non-Monotonic Sequential Text Generation	Feb 5, 2019	Imitation LearningPosition	CodeCode Available	0
NAOMI: Non-Autoregressive Multiresolution Sequence Imputation	Jan 30, 2019	Imitation LearningImputation	CodeCode Available	1
Go-Explore: a New Approach for Hard-Exploration Problems	Jan 30, 2019	Atari GamesImitation Learning	CodeCode Available	1

Title

Status

Hype

Visual-based Autonomous Driving Deployment from a Stochastic and Uncertainty-aware Perspective

CodeCode Available

GRP Model for Sensorimotor Learning

—Unverified

Learning Dynamic-Objective Policies from a Class of Optimal Trajectories

—Unverified

Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning

CodeCode Available

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup