Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 2122 papers

Title	Date	Tasks	Status	Hype	Score
Learning Structural Edits via Incremental Tree Transformations	Jan 28, 2021	Imitation Learning	CodeCode Available	1	5
Learning to Simulate Daily Activities via Modeling Dynamic Human Needs	Feb 9, 2023	Imitation LearningSand	CodeCode Available	1	5
Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation	Jun 15, 2024	Imitation LearningInductive Bias	CodeCode Available	1	5
CRIL: Continual Robot Imitation Learning via Generative and Prediction Model	Jun 17, 2021	Generative Adversarial NetworkImitation Learning	CodeCode Available	1	5
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld	Nov 28, 2023	Imitation Learning	CodeCode Available	1	5
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning	Apr 9, 2021	AI AgentBehavioural cloning	CodeCode Available	1	5
Critic Guided Segmentation of Rewarding Objects in First-Person Views	Jul 20, 2021	Imitation Learning	CodeCode Available	1	5
NAOMI: Non-Autoregressive Multiresolution Sequence Imputation	Jan 30, 2019	Imitation LearningImputation	CodeCode Available	1	5
End-to-End Egospheric Spatial Memory	Feb 15, 2021	General Reinforcement LearningImitation Learning	CodeCode Available	1	5
End-to-End Imitation Learning with Safety Guarantees using Control Barrier Functions	Dec 21, 2022	Imitation Learning	CodeCode Available	1	5
CAFE-AD: Cross-Scenario Adaptive Feature Enhancement for Trajectory Planning in Autonomous Driving	Apr 9, 2025	Autonomous DrivingFeature Importance	CodeCode Available	1	5
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning	Dec 12, 2022	Data AugmentationImage Generation	CodeCode Available	1	5
Learning Object Relation Graph and Tentative Policy for Visual Navigation	Jul 21, 2020	Imitation LearningRelation	CodeCode Available	1	5
Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning	Oct 27, 2021	Decision MakingImitation Learning	CodeCode Available	1	5
Off-Policy Imitation Learning from Observations	Feb 25, 2021	Imitation Learning	CodeCode Available	1	5
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap	Mar 4, 2021	Imitation Learning	CodeCode Available	1	5
Coherent Soft Imitation Learning	May 25, 2023	Imitation Learningreinforcement-learning	CodeCode Available	1	5
Learning Selective Communication for Multi-Agent Path Finding	Sep 12, 2021	Decision MakingDeep Reinforcement Learning	CodeCode Available	1	5
Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion	Jul 16, 2024	Efficient ExplorationImitation Learning	CodeCode Available	1	5
Learning Large Neighborhood Search for Vehicle Routing in Airport Ground Handling	Feb 27, 2023	Combinatorial OptimizationImitation Learning	CodeCode Available	1	5
Cross-Domain Imitation Learning via Optimal Transport	Oct 7, 2021	continuous-controlContinuous Control	CodeCode Available	1	5
Orca: Progressive Learning from Complex Explanation Traces of GPT-4	Jun 5, 2023	Imitation LearningKnowledge Distillation	CodeCode Available	1	5
f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning	Oct 2, 2020	Imitation Learning	CodeCode Available	1	5
Learning to combine primitive skills: A step towards versatile robotic manipulation	Aug 2, 2019	Data AugmentationImitation Learning	CodeCode Available	1	5
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft	Dec 7, 2021	Imitation LearningMinecraft	CodeCode Available	1	5
PateGail: A Privacy-Preserving Mobility Trajectory Generator with Imitation Learning	Jul 23, 2024	Decision MakingImitation Learning	CodeCode Available	1	5
DeeCap: Dynamic Early Exiting for Efficient Image Captioning	Jan 1, 2022	Image CaptioningImitation Learning	CodeCode Available	1	5
Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts	Jul 24, 2022	Deep Reinforcement LearningHumanoid Control	CodeCode Available	1	5
f-IRL: Inverse Reinforcement Learning via State Marginal Matching	Nov 9, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
General Characterization of Agents by States they Visit	Dec 2, 2020	Decision MakingImitation Learning	CodeCode Available	1	5
LILA: Language-Informed Latent Actions	Nov 5, 2021	Imitation Learning	CodeCode Available	1	5
Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage	May 21, 2021	continuous-controlContinuous Control	CodeCode Available	1	5
Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds	Oct 14, 2022	3D Point Cloud Reinforcement LearningImitation Learning	CodeCode Available	1	5
Preference-grounded Token-level Guidance for Language Model Fine-tuning	Jun 1, 2023	Imitation LearningLanguage Modeling	CodeCode Available	1	5
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization	Jun 23, 2020	Imitation Learningreinforcement-learning	CodeCode Available	1	5
Optimal Transport for Offline Imitation Learning	Mar 24, 2023	D4RLDecision Making	CodeCode Available	1	5
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation	Nov 25, 2022	continuous-controlContinuous Control	CodeCode Available	1	5
Proof Artifact Co-training for Theorem Proving with Language Models	Feb 11, 2021	Automated Theorem ProvingImitation Learning	CodeCode Available	1	5
SEABO: A Simple Search-Based Method for Offline Imitation Learning	Feb 6, 2024	D4RLImitation Learning	CodeCode Available	1	5
Generalization Guarantees for Imitation Learning	Aug 5, 2020	Generalization BoundsImitation Learning	CodeCode Available	1	5
Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset	Mar 15, 2019	Decision MakingImitation Learning	CodeCode Available	1	5
Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot Coordination	Jan 10, 2025	DiversityImitation Learning	CodeCode Available	0	5
Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations	Sep 16, 2019	Drone navigationImitation Learning	CodeCode Available	0	5
Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation	Nov 27, 2024	Imitation LearningLogical Reasoning	CodeCode Available	0	5
Learning Belief Representations for Imitation Learning in POMDPs	Jun 22, 2019	continuous-controlContinuous Control	CodeCode Available	0	5
An Imitation Learning Approach to Unsupervised Parsing	Jun 5, 2019	Imitation LearningLanguage Modeling	CodeCode Available	0	5
Learning Calibratable Policies using Programmatic Style-Consistency	Oct 2, 2019	Imitation LearningMuJoCo	CodeCode Available	0	5
Learning Robot Manipulation from Cross-Morphology Demonstration	Apr 7, 2023	Imitation LearningRobot Manipulation	CodeCode Available	0	5
Addressing reward bias in Adversarial Imitation Learning with neutral reward functions	Sep 20, 2020	Imitation Learning	CodeCode Available	0	5
Brain-Inspired Deep Imitation Learning for Autonomous Driving Systems	Jul 30, 2021	Autonomous DrivingImitation Learning	CodeCode Available	0	5

Show:10 25 50

← PrevPage 7 of 43Next →

No leaderboard results yet.