Imitation Learning

Imitation Learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Finally, a newer methodology, Inverse Q-Learning aims at directly learning Q-functions from expert data, implicitly representing rewards, under which the optimal policy can be given as a Boltzmann distribution similar to soft Q-learning

Source: Learning to Imitate

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1075 of 2122 papers

Title	Date	Tasks	Status	Hype
Learning to Structure an Image with Few Colors and Beyond	Aug 17, 2022	Image CompressionImitation Learning	—Unverified	0
Towards Informed Design and Validation Assistance in Computer Games Using Imitation Learning	Aug 15, 2022	Imitation LearningSurvey	—Unverified	0
Sequential Causal Imitation Learning with Unobserved Confounders	Aug 12, 2022	Decision MakingImitation Learning	—Unverified	0
Causal Imitation Learning with Unobserved Confounders	Aug 12, 2022	Imitation Learning	—Unverified	0
Exploring the trade off between human driving imitation and safety for traffic simulation	Aug 9, 2022	Imitation Learningreinforcement-learning	—Unverified	0
Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian Theory of Mind	Aug 4, 2022	Few-Shot LearningImitation Learning	CodeCode Available	0
Sequence Model Imitation Learning with Unobserved Contexts	Aug 3, 2022	continuous-controlContinuous Control	CodeCode Available	0
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis	Aug 3, 2022	Imitation Learning	—Unverified	0
See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation	Aug 1, 2022	Graph Neural NetworkImitation Learning	CodeCode Available	0
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination	Jul 31, 2022	Imitation Learningreinforcement-learning	—Unverified	0
Improved Policy Optimization for Online Imitation Learning	Jul 29, 2022	Imitation Learning	CodeCode Available	0
Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts	Jul 24, 2022	Deep Reinforcement LearningHumanoid Control	CodeCode Available	1
Robots Enact Malignant Stereotypes	Jul 23, 2022	Bias DetectionGender Bias Detection	—Unverified	0
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)	Jul 22, 2022	Imitation LearningMachine Translation	—Unverified	0
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations	Jul 20, 2022	Imitation LearningOffline RL	CodeCode Available	1
Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction	Jul 20, 2022	Imitation LearningMuJoCo	—Unverified	0
A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation	Jul 18, 2022	Imitation LearningReinforcement Learning (RL)	—Unverified	0
Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation	Jul 18, 2022	Deep Reinforcement LearningImitation Learning	CodeCode Available	0
Learning to Prove Trigonometric Identities	Jul 14, 2022	Automated Theorem ProvingImitation Learning	—Unverified	0
Finding Fallen Objects Via Asynchronous Audio-Visual Integration	Jul 7, 2022	Imitation LearningObject	—Unverified	0
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing	Jul 5, 2022	Adversarial AttackImitation Learning	CodeCode Available	0
Planning with RL and episodic-memory behavioral priors	Jul 5, 2022	Imitation LearningQ-Learning	—Unverified	0
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents	Jul 4, 2022	Decision MakingImitation Learning	CodeCode Available	2
Target-absent Human Attention	Jul 4, 2022	Imitation Learning	CodeCode Available	1
Discriminator-Guided Model-Based Offline Imitation Learning	Jul 1, 2022	Decision MakingImitation Learning	—Unverified	0

Show:10 25 50

← PrevPage 43 of 85Next →

No leaderboard results yet.