SOTAVerified

Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Title	Date	Tasks	Status
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors	Dec 30, 2024	CPUImitation Learning	—Unverified
The intrinsic motivation of reinforcement and imitation learning for sequential tasks	Dec 29, 2024	Imitation LearningMulti-Task Learning	—Unverified
Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker	Dec 28, 2024	Imitation LearningMeta-Learning	CodeCode Available
Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking	Dec 23, 2024	Human AnimationImitation Learning	—Unverified
Decoding fairness: a reinforcement learning perspective	Dec 20, 2024	FairnessImitation Learning	CodeCode Available
SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch	Dec 20, 2024	Imitation Learningreinforcement-learning	—Unverified
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting	Dec 19, 2024	AttributeImitation Learning	—Unverified
Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination	Dec 19, 2024	Imitation Learning	—Unverified
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models	Dec 18, 2024	HumanEvalImitation Learning	—Unverified
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model	Dec 18, 2024	DiversityImitation Learning	—Unverified

Title

Status

Hype

Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors

—Unverified

The intrinsic motivation of reinforcement and imitation learning for sequential tasks

—Unverified

Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker

CodeCode Available

Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking

—Unverified

Decoding fairness: a reinforcement learning perspective