Distilling Knowledge from Reader to Retriever for Question Answering

2020-12-08ICLR 2021Code Available1· sign in to hype

Gautier Izacard, Edouard Grave

Code Available — Be the first to reproduce this paper.

Code

github.com/facebookresearch/FiD
Officialpytorch★ 592
github.com/lucidrains/marge-pytorch
pytorch★ 76
github.com/hackerchenzhuo/LaKo
pytorch★ 25
github.com/FenQQQ/Fusion-in-decoder
pytorch★ 1

Abstract

The task of information retrieval is an important component of many natural language processing systems, such as open domain question answering. While traditional methods were based on hand-crafted features, continuous representations based on neural networks recently obtained competitive results. A challenge of using such methods is to obtain supervised data to train the retriever model, corresponding to pairs of query and support documents. In this paper, we propose a technique to learn retriever models for downstream tasks, inspired by knowledge distillation, and which does not require annotated pairs of query and documents. Our approach leverages attention scores of a reader model, used to solve the task based on retrieved documents, to obtain synthetic labels for the retriever. We evaluate our method on question answering, obtaining state-of-the-art results.

Tasks

Information Retrieval Knowledge Distillation Open-Domain Question Answering Question Answering Retrieval

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
NarrativeQA	FiD+Distil	Rouge-L	32	—	Unverified
TriviaQA	FiD+Distil	EM	72.1	—	Unverified

Distilling Knowledge from Reader to Retriever for Question Answering

Code

Abstract

Tasks

Benchmark Results

Reproductions