Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 776–800 of 2122 papers

Title	Date	Tasks	Status
Ranking-based Client Selection with Imitation Learning for Efficient Federated Learning	May 7, 2024	Federated LearningImitation Learning	—Unverified
Robotic Constrained Imitation Learning for the Peg Transfer Task in Fundamentals of Laparoscopic Surgery	May 6, 2024	Imitation Learning	—Unverified
VectorPainter: Advanced Stylized Vector Graphics Synthesis Using Stroke-Style Priors	May 5, 2024	Imitation LearningVector Graphics	—Unverified
Sub-goal Distillation: A Method to Improve Small Language Agents	May 4, 2024	Imitation LearningKnowledge Distillation	CodeCode Available
Imitation Learning in Discounted Linear MDPs without exploration assumptions	May 3, 2024	Imitation Learning	—Unverified
IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning	May 2, 2024	Imitation LearningPose Estimation	—Unverified
Continual Learning from Simulated Interactions via Multitask Prospective Rehearsal for Bionic Limb Behavior Modeling	May 2, 2024	Continual LearningImitation Learning	—Unverified
CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning	May 2, 2024	Imitation LearningTrajectory Planning	—Unverified
Guiding Attention in End-to-End Driving Models	Apr 30, 2024	Autonomous DrivingImitation Learning	CodeCode Available
A Survey of Imitation Learning Methods, Environments and Metrics	Apr 30, 2024	Imitation LearningSurvey	—Unverified
Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models	Apr 29, 2024	Decision MakingImitation Learning	CodeCode Available
Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods	Apr 25, 2024	Imitation Learning	—Unverified
Benchmarking Mobile Device Control Agents across Diverse Configurations	Apr 25, 2024	BenchmarkingImitation Learning	—Unverified
IDIL: Imitation Learning of Intent-Driven Expert Behavior	Apr 25, 2024	Imitation Learning	—Unverified
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots	Apr 22, 2024	Imitation LearningTask Planning	—Unverified
A survey of air combat behavior modeling using machine learning	Apr 22, 2024	Imitation LearningSurvey	—Unverified
Augmenting Safety-Critical Driving Scenarios while Preserving Similarity to Expert Trajectories	Apr 20, 2024	Imitation Learning	—Unverified
Bootstrapping Linear Models for Fast Online Adaptation in Human-Agent Collaboration	Apr 16, 2024	Human Agent CollaborationImitation Learning	CodeCode Available
Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model	Apr 15, 2024	Imitation LearningLanguage Modeling	—Unverified
Adversarial Imitation Learning via Boosting	Apr 12, 2024	Imitation Learning	—Unverified
AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent	Apr 11, 2024	Imitation Learning	—Unverified
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery	Apr 10, 2024	Decision MakingImitation Learning	—Unverified
SAFE-GIL: SAFEty Guided Imitation Learning for Robotic Systems	Apr 8, 2024	Autonomous NavigationImitation Learning	—Unverified
CNN-based Game State Detection for a Foosball Table	Apr 8, 2024	Deep Reinforcement LearningImitation Learning	—Unverified
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs	Apr 7, 2024	Autonomous DrivingImitation Learning	—Unverified

Show:10 25 50

← PrevPage 32 of 85Next →

No leaderboard results yet.