Simple Applications of BERT for Ad Hoc Document Retrieval
2019-03-26Code Available0· sign in to hype
Wei Yang, Haotian Zhang, Jimmy Lin
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/castorini/birchpytorch★ 0
- github.com/kasys-lab/anserini-kasysnone★ 0
Abstract
Following recent successes in applying BERT to question answering, we explore simple applications to ad hoc document retrieval. This required confronting the challenge posed by documents that are typically longer than the length of input BERT was designed to handle. We address this issue by applying inference on sentences individually, and then aggregating sentence scores to produce document scores. Experiments on TREC microblog and newswire test collections show that our approach is simple yet effective, as we report the highest average precision on these datasets by neural approaches that we are aware of.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| TREC Robust04 | BERT FT(Microblog) | MAP | 0.33 | — | Unverified |