Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 326–350 of 2122 papers

Title	Date	Tasks	Status	Hype	Score
PateGail: A Privacy-Preserving Mobility Trajectory Generator with Imitation Learning	Jul 23, 2024	Decision MakingImitation Learning	CodeCode Available	1	5
DeeCap: Dynamic Early Exiting for Efficient Image Captioning	Jan 1, 2022	Image CaptioningImitation Learning	CodeCode Available	1	5
Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts	Jul 24, 2022	Deep Reinforcement LearningHumanoid Control	CodeCode Available	1	5
f-IRL: Inverse Reinforcement Learning via State Marginal Matching	Nov 9, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
General Characterization of Agents by States they Visit	Dec 2, 2020	Decision MakingImitation Learning	CodeCode Available	1	5
LILA: Language-Informed Latent Actions	Nov 5, 2021	Imitation Learning	CodeCode Available	1	5
Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage	May 21, 2021	continuous-controlContinuous Control	CodeCode Available	1	5
Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds	Oct 14, 2022	3D Point Cloud Reinforcement LearningImitation Learning	CodeCode Available	1	5
Preference-grounded Token-level Guidance for Language Model Fine-tuning	Jun 1, 2023	Imitation LearningLanguage Modeling	CodeCode Available	1	5
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization	Jun 23, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
Optimal Transport for Offline Imitation Learning	Mar 24, 2023	D4RLDecision Making	CodeCode Available	1	5
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation	Nov 25, 2022	continuous-controlContinuous Control	CodeCode Available	1	5
Proof Artifact Co-training for Theorem Proving with Language Models	Feb 11, 2021	Automated Theorem ProvingImitation Learning	CodeCode Available	1	5
SEABO: A Simple Search-Based Method for Offline Imitation Learning	Feb 6, 2024	D4RLImitation Learning	CodeCode Available	1	5
Generalization Guarantees for Imitation Learning	Aug 5, 2020	Generalization BoundsImitation Learning	CodeCode Available	1	5
Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset	Mar 15, 2019	Decision MakingImitation Learning	CodeCode Available	1	5
Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot Coordination	Jan 10, 2025	DiversityImitation Learning	CodeCode Available	0	5
Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations	Sep 16, 2019	Drone navigationImitation Learning	CodeCode Available	0	5
Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation	Nov 27, 2024	Imitation LearningLogical Reasoning	CodeCode Available	0	5
Learning Belief Representations for Imitation Learning in POMDPs	Jun 22, 2019	continuous-controlContinuous Control	CodeCode Available	0	5
An Imitation Learning Approach to Unsupervised Parsing	Jun 5, 2019	Imitation LearningLanguage Modeling	CodeCode Available	0	5
Learning Calibratable Policies using Programmatic Style-Consistency	Oct 2, 2019	Imitation LearningMuJoCo	CodeCode Available	0	5
Learning Robot Manipulation from Cross-Morphology Demonstration	Apr 7, 2023	Imitation LearningRobot Manipulation	CodeCode Available	0	5
Addressing reward bias in Adversarial Imitation Learning with neutral reward functions	Sep 20, 2020	Imitation Learning	CodeCode Available	0	5
Brain-Inspired Deep Imitation Learning for Autonomous Driving Systems	Jul 30, 2021	Autonomous DrivingImitation Learning	CodeCode Available	0	5

Show:10 25 50

← PrevPage 14 of 85Next →

No leaderboard results yet.