Simple Applications of BERT for Ad Hoc Document Retrieval

2019-03-26Code Available0· sign in to hype

Wei Yang, Haotian Zhang, Jimmy Lin

Code Available — Be the first to reproduce this paper.

Code

github.com/castorini/birch
pytorch★ 143
github.com/kasys-lab/anserini-kasys
none★ 0

Abstract

Following recent successes in applying BERT to question answering, we explore simple applications to ad hoc document retrieval. This required confronting the challenge posed by documents that are typically longer than the length of input BERT was designed to handle. We address this issue by applying inference on sentences individually, and then aggregating sentence scores to produce document scores. Experiments on TREC microblog and newswire test collections show that our approach is simple yet effective, as we report the highest average precision on these datasets by neural approaches that we are aware of.

Tasks

Ad-Hoc Information Retrieval Question Answering Retrieval Sentence

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
TREC Robust04	BERT FT(Microblog)	MAP	0.33	—	Unverified

Simple Applications of BERT for Ad Hoc Document Retrieval

Code

Abstract

Tasks

Benchmark Results

Reproductions