Learning Dense Representations of Phrases at Scale

2020-12-23ACL 2021Code Available2· sign in to hype

Jinhyuk Lee, Mujeen Sung, Jaewoo Kang, Danqi Chen

Code Available — Be the first to reproduce this paper.

Code

github.com/princeton-nlp/DensePhrases
OfficialIn paperpytorch★ 606
github.com/jhyuklee/DensePhrases
OfficialIn paperpytorch★ 606
github.com/princeton-nlp/SimCSE
pytorch★ 3,646
github.com/dmis-lab/gener
pytorch★ 76

Abstract

Open-domain question answering can be reformulated as a phrase retrieval problem, without the need for processing documents on-demand during inference (Seo et al., 2019). However, current phrase retrieval models heavily depend on sparse representations and still underperform retriever-reader approaches. In this work, we show for the first time that we can learn dense representations of phrases alone that achieve much stronger performance in open-domain QA. We present an effective method to learn phrase representations from the supervision of reading comprehension tasks, coupled with novel negative sampling methods. We also propose a query-side fine-tuning strategy, which can support transfer learning and reduce the discrepancy between training and inference. On five popular open-domain QA datasets, our model DensePhrases improves over previous phrase retrieval models by 15%-25% absolute accuracy and matches the performance of state-of-the-art retriever-reader models. Our model is easy to parallelize due to pure dense representations and processes more than 10 questions per second on CPUs. Finally, we directly use our pre-indexed dense phrase representations for two slot filling tasks, showing the promise of utilizing DensePhrases as a dense knowledge base for downstream tasks.

Tasks

Open-Domain Question Answering Question Answering Question Generation Reading Comprehension Retrieval Slot Filling Transfer Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Natural Questions (long)	DensePhrases	F1	79.6	—	Unverified
SQuAD1.1 dev	DensePhrases	EM	78.3	—	Unverified

Learning Dense Representations of Phrases at Scale

Code

Abstract

Tasks

Benchmark Results

Reproductions