Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 2122 papers

Title	Date	Tasks	Status	Hype
When should we prefer Decision Transformers for Offline Reinforcement Learning?	May 23, 2023	D4RLImitation Learning	CodeCode Available	1
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach	Aug 18, 2021	Autonomous DrivingImitation Learning	CodeCode Available	1
FILM: Following Instructions in Language with Modular Methods	Oct 12, 2021	Imitation LearningInstruction Following	CodeCode Available	1
iCurb: Imitation Learning-based Detection of Road Curbs using Aerial Images for Autonomous Driving	Mar 31, 2021	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Discriminator Soft Actor Critic without Extrinsic Rewards	Jan 19, 2020	Imitation LearningQ-Learning	CodeCode Available	1
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations	Jul 20, 2022	Imitation LearningOffline RL	CodeCode Available	1
Disagreement-Regularized Imitation Learning	May 1, 2020	continuous-controlContinuous Control	CodeCode Available	1
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation	Nov 11, 2021	Imitation LearningMotion Planning	CodeCode Available	1
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning	Feb 8, 2024	Imitation Learning	CodeCode Available	1
DiffAIL: Diffusion Adversarial Imitation Learning	Dec 11, 2023	Decision MakingImitation Learning	CodeCode Available	1
Diffusing States and Matching Scores: A New Framework for Imitation Learning	Oct 17, 2024	continuous-controlContinuous Control	CodeCode Available	1
EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints	Nov 13, 2020	Imitation LearningMachine Translation	CodeCode Available	1
A Bayesian Approach to Robust Inverse Reinforcement Learning	Sep 15, 2023	Imitation LearningMuJoCo	CodeCode Available	1
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects	Oct 3, 2024	BenchmarkingImitation Learning	CodeCode Available	1
Adversarial Option-Aware Hierarchical Imitation Learning	Jun 10, 2021	Imitation Learning	CodeCode Available	1
Emergent Communication at Scale	Sep 29, 2021	Imitation Learning	CodeCode Available	1
DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment	Oct 15, 2024	Deformable Object ManipulationImitation Learning	CodeCode Available	1
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation	Nov 25, 2022	continuous-controlContinuous Control	CodeCode Available	1
Active Imitation Learning with Noisy Guidance	May 26, 2020	Active LearningImitation Learning	CodeCode Available	1
Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset	Mar 15, 2019	Decision MakingImitation Learning	CodeCode Available	1
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients	Feb 21, 2020	Imitation LearningTransfer Learning	CodeCode Available	1
Everyone Deserves A Reward: Learning Customized Human Preferences	Sep 6, 2023	DiversityImitation Learning	CodeCode Available	1
Augmented Behavioral Cloning from Observation	Apr 28, 2020	Behavioural cloningImitation Learning	CodeCode Available	1
Exact Combinatorial Optimization with Graph Convolutional Neural Networks	Jun 4, 2019	Combinatorial OptimizationImitation Learning	CodeCode Available	1
DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling	Dec 6, 2024	Dialogue GenerationImitation Learning	CodeCode Available	1
A deep inverse reinforcement learning approach to route choice modeling with context-dependent rewards	Jun 18, 2022	Computational EfficiencyDemand Forecasting	CodeCode Available	1
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining	Apr 5, 2022	Autonomous DrivingImitation Learning	CodeCode Available	1
Following High-level Navigation Instructions on a Simulated Quadcopter with Imitation Learning	May 31, 2018	Imitation LearningInstruction Following	CodeCode Available	1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving	Oct 29, 2022	Autonomous DrivingCARLA MAP Leaderboard	CodeCode Available	1
GAIL-PT: A Generic Intelligent Penetration Testing Framework with Generative Adversarial Imitation Learning	Apr 5, 2022	Imitation LearningQ-Learning	CodeCode Available	1
Generalization Guarantees for Imitation Learning	Aug 5, 2020	Generalization BoundsImitation Learning	CodeCode Available	1
Generalized Decision Transformer for Offline Hindsight Information Matching	Nov 19, 2021	continuous-controlContinuous Control	CodeCode Available	1
DERAIL: Diagnostic Environments for Reward And Imitation Learning	Dec 2, 2020	DiagnosticImitation Learning	CodeCode Available	1
Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds	Oct 2, 2020	Imitation LearningMotion Planning	CodeCode Available	1
Go-Explore: a New Approach for Hard-Exploration Problems	Jan 30, 2019	Atari GamesImitation Learning	CodeCode Available	1
Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation	Jul 10, 2024	Imitation Learning	CodeCode Available	1
HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation	Mar 19, 2025	Autonomous VehiclesImitation Learning	CodeCode Available	1
Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control	Oct 5, 2022	Imitation LearningMulti-Task Learning	CodeCode Available	1
HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding	Feb 23, 2024	Imitation LearningReinforcement Learning (RL)	CodeCode Available	1
Autonomous Racing using a Hybrid Imitation-Reinforcement Learning Architecture	Oct 11, 2021	Autonomous RacingAutonomous Vehicles	CodeCode Available	1
DeeCap: Dynamic Early Exiting for Efficient Image Captioning	Jan 1, 2022	Image CaptioningImitation Learning	CodeCode Available	1
DART: Noise Injection for Robust Imitation Learning	Mar 27, 2017	Imitation LearningMuJoCo	CodeCode Available	1
AI2-THOR: An Interactive 3D Environment for Visual AI	Dec 14, 2017	Deep Reinforcement LearningImitation Learning	CodeCode Available	1
A Visual Navigation Perspective for Category-Level Object Pose Estimation	Mar 25, 2022	Imitation LearningPose Estimation	CodeCode Available	1
Curriculum Offline Imitation Learning	Nov 3, 2021	continuous-controlContinuous Control	CodeCode Available	1
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning	Feb 16, 2023	Imitation LearningOffline RL	CodeCode Available	1
BabyAI 1.1	Jul 24, 2020	Computational EfficiencyImitation Learning	CodeCode Available	1
Zero-Shot Compositional Policy Learning via Language Grounding	Apr 15, 2020	DescriptiveDomain Adaptation	CodeCode Available	1
BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps	May 10, 2020	Imitation LearningNavigate	CodeCode Available	1
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning	Nov 2, 2010	Imitation LearningStructured Prediction	CodeCode Available	1

Show:10 25 50

← PrevPage 4 of 43Next →

No leaderboard results yet.