SOTAVerified

Using Enhanced Gaussian Cross-Entropy in Imitation Learning to Digging the First Diamond in Minecraft

2020-12-14CUHK Course IERG5350 2020Unverified0· sign in to hype

Yingjie Cai, Xiao Zhang

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Although state-ofthe-art reinforcement learning (RL) systems has led to breakthroughs in many difficult tasks, the sample inefficiency of standard reinforcement learning methods still precludes their application to more extremely complex tasks. Such limitation will make many reinforcement learning systems cannot be applied to real-world problem, in which environment samples are expensive. To solve this problem, MineRL (13) provide an ideal develop environment to facilitate the research that leveraging fewer human demonstrations with more efficient reinforcement learning systems. Based on the MineRL environmnet, we proposed an enhanced Gaussian cross entropy (EGCE) loss for imitation learnning problems to achieve ideal performance. In the ObtainDiamond task, our EGCE achieves about 7.7% improvement than a strong baseline imitation learning pipeline. The demo video is available at here.

Tasks

Reproductions