Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 2122 papers

Title	Date	Tasks	Status	Hype
Spatially Visual Perception for End-to-End Robotic Learning	Nov 26, 2024	Depth EstimationImage Augmentation	—Unverified	0
Self-reconfiguration Strategies for Space-distributed Spacecraft	Nov 26, 2024	Imitation Learning	—Unverified	0
LHPF: Look back the History and Plan for the Future in Autonomous Driving	Nov 26, 2024	Autonomous DrivingDecision Making	—Unverified	0
RoCoDA: Counterfactual Data Augmentation for Data-Efficient Robot Learning from Demonstrations	Nov 25, 2024	counterfactualData Augmentation	—Unverified	0
End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning	Nov 25, 2024	Autonomous DrivingAutonomous Vehicles	—Unverified	0
WildLMa: Long Horizon Loco-Manipulation in the Wild	Nov 22, 2024	Imitation Learning	—Unverified	0
Neuromorphic Attitude Estimation and Control	Nov 21, 2024	Imitation Learning	CodeCode Available	1
Instant Policy: In-Context Imitation Learning via Graph Diffusion	Nov 19, 2024	Graph GenerationImitation Learning	—Unverified	0
Error-Feedback Model for Output Correction in Bilateral Control-Based Imitation Learning	Nov 19, 2024	Imitation Learning	—Unverified	0
Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms	Nov 18, 2024	Imitation LearningModel Compression	—Unverified	0
Learning Generalizable 3D Manipulation With 10 Demonstrations	Nov 15, 2024	DenoisingImitation Learning	CodeCode Available	0
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation	Nov 15, 2024	Domain AdaptationImitation Learning	CodeCode Available	0
Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment	Nov 14, 2024	BIRLImitation Learning	—Unverified	0
Robot See, Robot Do: Imitation Reward for Noisy Financial Environments	Nov 13, 2024	Decision MakingImitation Learning	—Unverified	0
Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach	Nov 12, 2024	Autonomous DrivingImitation Learning	—Unverified	0
Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning	Nov 12, 2024	Imitation LearningOffline RL	—Unverified	0
Learning Memory Mechanisms for Decision Making through Demonstrations	Nov 12, 2024	Decision MakingImitation Learning	CodeCode Available	0
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners	Nov 12, 2024	Imitation Learning	—Unverified	0
Identifying Differential Patient Care Through Inverse Intent Inference	Nov 11, 2024	counterfactualImitation Learning	—Unverified	0
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration	Nov 11, 2024	continuous-controlContinuous Control	—Unverified	0
Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion	Nov 7, 2024	Data AugmentationImitation Learning	CodeCode Available	1
IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving	Nov 7, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Scaling Laws for Pre-training Agents and World Models	Nov 7, 2024	Imitation LearningLanguage Modeling	—Unverified	0
ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy	Nov 6, 2024	Imitation LearningRobot Manipulation	—Unverified	0
Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting	Nov 5, 2024	Imitation LearningPoint Tracking	—Unverified	0
Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy for Visuomotor Imitation Learning	Nov 5, 2024	Continual LearningImitation Learning	—Unverified	0
Efficient Active Imitation Learning with Random Network Distillation	Nov 4, 2024	Imitation Learning	—Unverified	0
So You Think You Can Scale Up Autonomous Robot Data Collection?	Nov 4, 2024	Imitation LearningReinforcement Learning (RL)	—Unverified	0
GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation	Nov 2, 2024	Imitation Learning	CodeCode Available	2
Safe Imitation Learning-based Optimal Energy Storage Systems Dispatch in Distribution Networks	Nov 1, 2024	Deep Reinforcement LearningImitation Learning	—Unverified	0
State- and context-dependent robotic manipulation and grasping via uncertainty-aware imitation learning	Oct 31, 2024	Imitation LearningUncertainty Quantification	—Unverified	0
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment	Oct 31, 2024	Imitation LearningTransfer Learning	CodeCode Available	0
EgoMimic: Scaling Imitation Learning via Egocentric Video	Oct 31, 2024	DiversityImitation Learning	CodeCode Available	2
DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning	Oct 31, 2024	Imitation Learning	—Unverified	0
3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing	Oct 31, 2024	Imitation Learning	—Unverified	0
SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving	Oct 30, 2024	Autonomous DrivingImitation Learning	—Unverified	0
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation	Oct 30, 2024	Imitation LearningIncremental Learning	—Unverified	0
Keypoint Abstraction using Large Models for Object-Relative Imitation Learning	Oct 30, 2024	Imitation LearningObject	—Unverified	0
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning	Oct 29, 2024	Imitation LearningReinforcement Learning (RL)	—Unverified	0
Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding	Oct 28, 2024	Imitation LearningMulti-Agent Path Finding	—Unverified	0
Identifying Selections for Unsupervised Subtask Discovery	Oct 28, 2024	Imitation Learning	—Unverified	0
Unveiling the Role of Expert Guidance: A Comparative Analysis of User-centered Imitation Learning and Traditional Reinforcement Learning	Oct 28, 2024	Imitation LearningUnity	—Unverified	0
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images	Oct 26, 2024	Imitation LearningVideo Prediction	—Unverified	0
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization	Oct 25, 2024	Imitation Learning	CodeCode Available	2
MILES: Making Imitation Learning Easy with Self-Supervision	Oct 25, 2024	Contact-rich ManipulationImitation Learning	—Unverified	0
SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment	Oct 24, 2024	Imitation LearningMotion Planning	—Unverified	0
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation	Oct 23, 2024	Imitation LearningMotion Planning	—Unverified	0
Reinforced Imitative Trajectory Planning for Urban Automated Driving	Oct 21, 2024	Imitation Learningreinforcement-learning	CodeCode Available	1
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning	Oct 21, 2024	Imitation Learning	—Unverified	0
Reward-free World Models for Online Imitation Learning	Oct 17, 2024	Imitation LearningQ-Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 43Next →

No leaderboard results yet.