Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 2122 papers

Title	Date	Tasks	Status	Hype	Score
Aligning Time Series on Incomparable Spaces	Jun 22, 2020	Dynamic Time WarpingImitation Learning	CodeCode Available	1	5
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning	Mar 1, 2023	Continuous ControlImitation Learning	CodeCode Available	1	5
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining	Apr 5, 2022	Autonomous DrivingImitation Learning	CodeCode Available	1	5
Imitation Learning by Estimating Expertise of Demonstrators	Feb 2, 2022	continuous-controlContinuous Control	CodeCode Available	1	5
Imitation Learning via Off-Policy Distribution Matching	Dec 10, 2019	Imitation LearningReinforcement Learning	CodeCode Available	1	5
End-to-End Egospheric Spatial Memory	Feb 15, 2021	General Reinforcement LearningImitation Learning	CodeCode Available	1	5
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft	Dec 7, 2021	Imitation LearningMinecraft	CodeCode Available	1	5
f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning	Oct 2, 2020	Imitation Learning	CodeCode Available	1	5
Energy-Based Imitation Learning	Apr 20, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach	Aug 18, 2021	Autonomous DrivingImitation Learning	CodeCode Available	1	5
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients	Feb 21, 2020	Imitation LearningTransfer Learning	CodeCode Available	1	5
Imitating Latent Policies from Observation	May 21, 2018	Imitation Learning	CodeCode Available	1	5
Imitating Unknown Policies via Exploration	Aug 13, 2020	Behavioural cloningImitation Learning	CodeCode Available	1	5
Behavioral Cloning from Observation	May 4, 2018	Imitation Learning	CodeCode Available	1	5
EvIL: Evolution Strategies for Generalisable Imitation Learning	Jun 15, 2024	Behavioural cloningcontinuous-control	CodeCode Available	1	5
Everyone Deserves A Reward: Learning Customized Human Preferences	Sep 6, 2023	DiversityImitation Learning	CodeCode Available	1	5
Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion	Feb 25, 2024	Imitation Learning	CodeCode Available	1	5
Critic Guided Segmentation of Rewarding Objects in First-Person Views	Jul 20, 2021	Imitation Learning	CodeCode Available	1	5
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning	Nov 2, 2010	Imitation LearningStructured Prediction	CodeCode Available	1	5
Explorative Imitation Learning: A Path Signature Approach for Continuous Environments	Jul 5, 2024	Behavioural cloningImitation Learning	CodeCode Available	1	5
Multi-Agent Interactions Modeling with Correlated Policies	Jan 4, 2020	Imitation Learning	CodeCode Available	1	5
IGibson 1.0: a Simulation Environment for Interactive Tasks in Large Realistic Scenes	Dec 5, 2020	Imitation Learning	CodeCode Available	1	5
Imitation Learning with Sinkhorn Distances	Aug 20, 2020	Imitation LearningMuJoCo	CodeCode Available	1	5
JUICER: Data-Efficient Imitation Learning for Robotic Assembly	Apr 4, 2024	Data AugmentationImitation Learning	CodeCode Available	1	5
An Adversarial Imitation Click Model for Information Retrieval	Apr 13, 2021	Imitation LearningInformation Retrieval	CodeCode Available	1	5
f-IRL: Inverse Reinforcement Learning via State Marginal Matching	Nov 9, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds	Oct 14, 2022	3D Point Cloud Reinforcement LearningImitation Learning	CodeCode Available	1	5
Normalizing Flows are Capable Models for RL	May 29, 2025	Imitation LearningReinforcement Learning (RL)	CodeCode Available	1	5
Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment	Nov 7, 2023	Imitation Learning	CodeCode Available	1	5
CLIPort: What and Where Pathways for Robotic Manipulation	Sep 24, 2021	Imitation LearningRobotic Grasping	CodeCode Available	1	5
Hybrid Inverse Reinforcement Learning	Feb 13, 2024	continuous-controlContinuous Control	CodeCode Available	1	5
Chain-of-Thought Predictive Control	Apr 3, 2023	Imitation Learning	CodeCode Available	1	5
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap	Mar 4, 2021	Imitation Learning	CodeCode Available	1	5
OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning	May 24, 2024	continuous-controlContinuous Control	CodeCode Available	1	5
CDT: Cascading Decision Trees for Explainable Reinforcement Learning	Nov 15, 2020	Deep Reinforcement LearningExplainable Models	CodeCode Available	1	5
Human-compatible driving partners through data-regularized self-play reinforcement learning	Mar 28, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1	5
Generalized Decision Transformer for Offline Hindsight Information Matching	Nov 19, 2021	continuous-controlContinuous Control	CodeCode Available	1	5
Optimal Power Flow Using Graph Neural Networks	Oct 21, 2019	Imitation Learningvalid	CodeCode Available	1	5
Bootstrapped Model Predictive Control	Mar 24, 2025	continuous-controlContinuous Control	CodeCode Available	1	5
Globally Stable Neural Imitation Policies	Mar 7, 2024	Imitation Learning	CodeCode Available	1	5
Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds	Oct 2, 2020	Imitation LearningMotion Planning	CodeCode Available	1	5
Global Tensor Motion Planning	Nov 28, 2024	Dataset GenerationDiversity	CodeCode Available	1	5
iCurb: Imitation Learning-based Detection of Road Curbs using Aerial Images for Autonomous Driving	Mar 31, 2021	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1	5
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies	Apr 5, 2023	DenoisingImitation Learning	CodeCode Available	1	5
Causal Imitation Learning under Temporally Correlated Noise	Feb 2, 2022	EconometricsImitation Learning	CodeCode Available	1	5
Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning	Jun 5, 2025	Imitation Learning	CodeCode Available	1	5
Causal Imitative Model for Autonomous Driving	Dec 7, 2021	Autonomous DrivingImitation Learning	CodeCode Available	1	5
Guiding Deep Molecular Optimization with Genetic Exploration	Jul 4, 2020	Imitation Learning	CodeCode Available	1	5
Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation	Jul 10, 2024	Imitation Learning	CodeCode Available	1	5
A Coupled Flow Approach to Imitation Learning	Apr 29, 2023	Density EstimationImitation Learning	CodeCode Available	1	5

Show:10 25 50

← PrevPage 5 of 43Next →

No leaderboard results yet.