Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 2122 papers

Title	Date	Tasks	Status	Hype	Score
Steering Language Models with Game-Theoretic Solvers	Jan 24, 2024	Imitation LearningScheduling	CodeCode Available	9	5
OpenVLA: An Open-Source Vision-Language-Action Model	Jun 13, 2024	Imitation LearningLanguage Modelling	CodeCode Available	9	5
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI	Oct 1, 2024	GPUImitation Learning	CodeCode Available	7	5
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success	Feb 27, 2025	Action GenerationChunking	CodeCode Available	5	5
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations	Mar 6, 2024	Imitation LearningRobot Manipulation	CodeCode Available	5	5
Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments	Jan 10, 2023	GPUImitation Learning	CodeCode Available	5	5
PointVLA: Injecting the 3D World into Vision-Language-Action Models	Mar 10, 2025	Imitation LearningSpatial Reasoning	CodeCode Available	4	5
ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning	Aug 4, 2024	DecoderImitation Learning	CodeCode Available	4	5
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance	Jan 26, 2025	Autonomous DrivingImitation Learning	CodeCode Available	4	5
An Imitative Reinforcement Learning Framework for Autonomous Dogfight	Jun 17, 2024	Imitation Learningreinforcement-learning	CodeCode Available	3	5
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters	May 4, 2022	GPUImitation Learning	CodeCode Available	3	5
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark	Jul 10, 2024	Imitation Learning	CodeCode Available	3	5
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving	Sep 29, 2023	Arithmetic ReasoningComputational Efficiency	CodeCode Available	3	5
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion	Nov 4, 2023	BenchmarkingImitation Learning	CodeCode Available	3	5
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos	Jun 23, 2022	Imitation LearningMinecraft	CodeCode Available	3	5
Behavior Generation with Latent Actions	Mar 5, 2024	Autonomous DrivingDecision Making	CodeCode Available	3	5
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving	May 31, 2022	Autonomous DrivingCARLA longest6	CodeCode Available	3	5
Is Value Learning Really the Main Bottleneck in Offline RL?	Jun 13, 2024	Imitation LearningOffline RL	CodeCode Available	3	5
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation	Mar 4, 2025	Contact-rich ManipulationImitation Learning	CodeCode Available	3	5
imitation: Clean Imitation Learning Implementations	Nov 22, 2022	Imitation Learningreinforcement-learning	CodeCode Available	3	5
A Survey of Embodied Learning for Object-Centric Robotic Manipulation	Aug 21, 2024	Imitation LearningObject	CodeCode Available	3	5
Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments	Sep 9, 2024	Imitation Learning	CodeCode Available	3	5
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos	Nov 26, 2024	Common Sense ReasoningImitation Learning	CodeCode Available	3	5
BridgeData V2: A Dataset for Robot Learning at Scale	Aug 24, 2023	Imitation LearningMulti-Task Learning	CodeCode Available	2	5
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization	Oct 25, 2024	Imitation Learning	CodeCode Available	2	5
Model-Based Imitation Learning for Urban Driving	Oct 14, 2022	3D geometryAutonomous Driving	CodeCode Available	2	5
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving	Apr 19, 2021	Autonomous Driving	CodeCode Available	2	5
MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale	Aug 29, 2024	Deep Reinforcement LearningImitation Learning	CodeCode Available	2	5
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks	Dec 9, 2024	GPUImitation Learning	CodeCode Available	2	5
MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations	Oct 26, 2023	Imitation Learning	CodeCode Available	2	5
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world	Jun 20, 2022	Imitation Learning	CodeCode Available	2	5
PlanT: Explainable Planning Transformers via Object-Level Representations	Oct 25, 2022	CARLA longest6Decision Making	CodeCode Available	2	5
Language-Driven Representation Learning for Robotics	Feb 24, 2023	Contrastive LearningImitation Learning	CodeCode Available	2	5
A General Language Assistant as a Laboratory for Alignment	Dec 1, 2021	Imitation Learning	CodeCode Available	2	5
Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving	Sep 24, 2024	Autonomous DrivingImitation Learning	CodeCode Available	2	5
GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation	Nov 2, 2024	Imitation Learning	CodeCode Available	2	5
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks	Mar 27, 2025	Imitation LearningMathematical Reasoning	CodeCode Available	2	5
EgoMimic: Scaling Imitation Learning via Egocentric Video	Oct 31, 2024	DiversityImitation Learning	CodeCode Available	2	5
Equivariant Diffusion Policy	Jul 1, 2024	Imitation LearningRobot Manipulation	CodeCode Available	2	5
Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning	Jun 30, 2025	Imitation LearningTrajectory Planning	CodeCode Available	2	5
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation	Oct 19, 2022	Deep Reinforcement LearningImitation Learning	CodeCode Available	2	5
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation	May 22, 2023	Imitation LearningMotion Planning	CodeCode Available	2	5
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions	Feb 21, 2024	Decision MakingImitation Learning	CodeCode Available	2	5
In-Context Imitation Learning via Next-Token Prediction	Aug 28, 2024	Imitation LearningPrediction	CodeCode Available	2	5
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations	Apr 26, 2024	Imitation Learning	CodeCode Available	2	5
LangProp: A code optimization framework using Large Language Models applied to driving	Jan 18, 2024	Autonomous DrivingCode Generation	CodeCode Available	2	5
AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control	Apr 5, 2021	Imitation LearningReinforcement Learning (RL)	CodeCode Available	2	5
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies	Feb 6, 2024	Decision MakingDiversity	CodeCode Available	2	5
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling	Jan 20, 2025	Imitation LearningLanguage Modeling	CodeCode Available	2	5
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References	Feb 13, 2025	Human-Object Interaction DetectionImitation Learning	CodeCode Available	2	5

Show:10 25 50

← PrevPage 1 of 43Next →

No leaderboard results yet.