SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status
End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning	Nov 25, 2024	Autonomous DrivingAutonomous Vehicles	—Unverified
Energy-Based Sequence GANs for Recommendation and Their Connection to Imitation Learning	Jun 28, 2017	Imitation LearningRecommendation Systems	—Unverified
EnerVerse-AC: Envisioning Embodied Environments with Action Condition	May 14, 2025	Image GenerationImitation Learning	—Unverified
Enhanced DACER Algorithm with High Diffusion Efficiency	May 29, 2025	DenoisingImitation Learning	—Unverified
Enhanced Generalization through Prioritization and Diversity in Self-Imitation Reinforcement Learning over Procedural Environments with Sparse Rewards	Nov 1, 2023	Decision MakingDiversity	—Unverified
Enhancing Autonomous Driving Safety with Collision Scenario Integration	Mar 5, 2025	Autonomous DrivingCollision Avoidance	—Unverified
Enhancing Reusability of Learned Skills for Robot Manipulation via Gaze and Bottleneck	Feb 25, 2025	Imitation LearningObject	—Unverified
Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning	Sep 27, 2024	Federated LearningImitation Learning	—Unverified
EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning	Jul 22, 2018	Imitation LearningMuJoCo	—Unverified
Entity-Centric Coreference Resolution with Model Stacking	Jul 1, 2015	coreference-resolutionCoreference Resolution	—Unverified

Title

Status

Hype

End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning

—Unverified

Energy-Based Sequence GANs for Recommendation and Their Connection to Imitation Learning

—Unverified

EnerVerse-AC: Envisioning Embodied Environments with Action Condition

—Unverified

Enhanced DACER Algorithm with High Diffusion Efficiency

—Unverified

Enhanced Generalization through Prioritization and Diversity in Self-Imitation Reinforcement Learning over Procedural Environments with Sparse Rewards