SOTAVerified

Blending Autonomous Exploration and Apprenticeship Learning

2011-12-01NeurIPS 2011Unverified0· sign in to hype

Thomas J. Walsh, Daniel K. Hewlett, Clayton T. Morrison

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present theoretical and empirical results for a framework that combines the benefits of apprenticeship and autonomous reinforcement learning. Our approach modifies an existing apprenticeship learning framework that relies on teacher demonstrations and does not necessarily explore the environment. The first change is replacing previously used Mistake Bound model learners with a recently proposed framework that melds the KWIK and Mistake Bound supervised learning protocols. The second change is introducing a communication of expected utility from the student to the teacher. The resulting system only uses teacher traces when the agent needs to learn concepts it cannot efficiently learn on its own.

Tasks

Reproductions