SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control	Feb 21, 2025	Imitation Learningreinforcement-learning	—Unverified
Making Universal Policies Universal	Feb 20, 2025	Imitation LearningSequential Decision Making	CodeCode Available
MILE: Model-based Intervention Learning	Feb 19, 2025	Imitation Learningmodel	—Unverified
A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects	Feb 19, 2025	Imitation LearningPoint Tracking	—Unverified
Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning	Feb 19, 2025	Imitation Learning	—Unverified
ModSkill: Physical Character Skill Modularization	Feb 19, 2025	Imitation LearningMotion Generation	—Unverified
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit	Feb 18, 2025	Imitation Learning	—Unverified
Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification	Feb 18, 2025	Imitation LearningPrediction	—Unverified
Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks	Feb 18, 2025	Imitation LearningMinecraft	CodeCode Available
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning	Feb 18, 2025	3DGSAutonomous Driving	—Unverified

Title

Status

Hype

Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control

—Unverified

Making Universal Policies Universal

CodeCode Available

MILE: Model-based Intervention Learning

—Unverified

A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects

—Unverified

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning