Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 2122 papers

Title	Date	Tasks	Status	Hype
Aligning Time Series on Incomparable Spaces	Jun 22, 2020	Dynamic Time WarpingImitation Learning	CodeCode Available	1
JUICER: Data-Efficient Imitation Learning for Robotic Assembly	Apr 4, 2024	Data AugmentationImitation Learning	CodeCode Available	1
LaND: Learning to Navigate from Disengagements	Oct 9, 2020	Autonomous NavigationImitation Learning	CodeCode Available	1
Disagreement-Regularized Imitation Learning	May 1, 2020	continuous-controlContinuous Control	CodeCode Available	1
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning	Feb 8, 2024	Imitation Learning	CodeCode Available	1
Latent Plans for Task-Agnostic Offline Reinforcement Learning	Sep 19, 2022	Imitation Learningreinforcement-learning	CodeCode Available	1
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining	Apr 5, 2022	Autonomous DrivingImitation Learning	CodeCode Available	1
Causal Imitative Model for Autonomous Driving	Dec 7, 2021	Autonomous DrivingImitation Learning	CodeCode Available	1
Diffusing States and Matching Scores: A New Framework for Imitation Learning	Oct 17, 2024	continuous-controlContinuous Control	CodeCode Available	1
Learning Large Neighborhood Search for Vehicle Routing in Airport Ground Handling	Feb 27, 2023	Combinatorial OptimizationImitation Learning	CodeCode Available	1
Learning Selective Communication for Multi-Agent Path Finding	Sep 12, 2021	Decision MakingDeep Reinforcement Learning	CodeCode Available	1
Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts	Jul 24, 2022	Deep Reinforcement LearningHumanoid Control	CodeCode Available	1
Learning to Extrapolate: A Transductive Approach	Apr 27, 2023	Imitation Learning	CodeCode Available	1
Behavioral Cloning from Observation	May 4, 2018	Imitation Learning	CodeCode Available	1
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos	Aug 12, 2021	Imitation Learningmotion retargeting	CodeCode Available	1
Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation	Jun 15, 2024	Imitation LearningInductive Bias	CodeCode Available	1
Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion	Feb 25, 2024	Imitation Learning	CodeCode Available	1
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation	Mar 5, 2022	Imitation LearningVision and Language Navigation	CodeCode Available	1
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning	Nov 2, 2010	Imitation LearningStructured Prediction	CodeCode Available	1
DERAIL: Diagnostic Environments for Reward And Imitation Learning	Dec 2, 2020	DiagnosticImitation Learning	CodeCode Available	1
LPAC: Learnable Perception-Action-Communication Loops with Applications to Coverage Control	Jan 10, 2024	Graph Neural NetworkImitation Learning	CodeCode Available	1
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning	Mar 1, 2023	Continuous ControlImitation Learning	CodeCode Available	1
DiffAIL: Diffusion Adversarial Imitation Learning	Dec 11, 2023	Decision MakingImitation Learning	CodeCode Available	1
Discriminator Soft Actor Critic without Extrinsic Rewards	Jan 19, 2020	Imitation LearningQ-Learning	CodeCode Available	1
End-to-End Egospheric Spatial Memory	Feb 15, 2021	General Reinforcement LearningImitation Learning	CodeCode Available	1
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage	Jun 6, 2021	continuous-controlContinuous Control	CodeCode Available	1
Modeling 3D Shapes by Reinforcement Learning	Mar 27, 2020	Deep Reinforcement LearningImitation Learning	CodeCode Available	1
MoËT: Mixture of Expert Trees and its Application to Verifiable Reinforcement Learning	Jun 16, 2019	Game of GoImitation Learning	CodeCode Available	1
Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment	Nov 7, 2023	Imitation Learning	CodeCode Available	1
MotionAug: Augmentation with Physical Correction for Human Motion Prediction	Mar 17, 2022	Data AugmentationDiversity	CodeCode Available	1
Deep Imitation Learning for Bimanual Robotic Manipulation	Oct 11, 2020	Graph Neural NetworkImitation Learning	CodeCode Available	1
A deep inverse reinforcement learning approach to route choice modeling with context-dependent rewards	Jun 18, 2022	Computational EfficiencyDemand Forecasting	CodeCode Available	1
NAOMI: Non-Autoregressive Multiresolution Sequence Imputation	Jan 30, 2019	Imitation LearningImputation	CodeCode Available	1
NEAT: Neural Attention Fields for End-to-End Autonomous Driving	Sep 9, 2021	Autonomous DrivingCARLA longest6	CodeCode Available	1
An Empirical Investigation of Representation Learning for Imitation	May 16, 2022	image-classificationImage Classification	CodeCode Available	1
Normalizing Flows are Capable Models for RL	May 29, 2025	Imitation LearningReinforcement Learning (RL)	CodeCode Available	1
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update	Feb 1, 2024	Imitation LearningOffline RL	CodeCode Available	1
Off-Policy Adversarial Inverse Reinforcement Learning	May 3, 2020	continuous-controlContinuous Control	CodeCode Available	1
DART: Noise Injection for Robust Imitation Learning	Mar 27, 2017	Imitation LearningMuJoCo	CodeCode Available	1
DeeCap: Dynamic Early Exiting for Efficient Image Captioning	Jan 1, 2022	Image CaptioningImitation Learning	CodeCode Available	1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving	Oct 29, 2022	Autonomous DrivingCARLA MAP Leaderboard	CodeCode Available	1
Optimal Transport for Offline Imitation Learning	Mar 24, 2023	D4RLDecision Making	CodeCode Available	1
CDT: Cascading Decision Trees for Explainable Reinforcement Learning	Nov 15, 2020	Deep Reinforcement LearningExplainable Models	CodeCode Available	1
PADL: Language-Directed Physics-Based Character Control	Jan 31, 2023	Image GenerationImitation Learning	CodeCode Available	1
An Imitation Game for Learning Semantic Parsers from User Interaction	May 2, 2020	Imitation LearningText to SQL	CodeCode Available	1
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning	Jul 4, 2023	DecoderImitation Learning	CodeCode Available	1
Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning	Jun 5, 2025	Imitation Learning	CodeCode Available	1
Curricular Subgoals for Inverse Reinforcement Learning	Jun 14, 2023	Autonomous DrivingD4RL	CodeCode Available	1
General Characterization of Agents by States they Visit	Dec 2, 2020	Decision MakingImitation Learning	CodeCode Available	1
A Coupled Flow Approach to Imitation Learning	Apr 29, 2023	Density EstimationImitation Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 43Next →

No leaderboard results yet.