SOTAVerified

Global Entity Disambiguation with BERT

2019-09-01NAACL 2022Code Available1· sign in to hype

Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia. We achieve new state-of-the-art results on five standard ED datasets: AIDA-CoNLL, MSNBC, AQUAINT, ACE2004, and WNED-WIKI. The source code and model checkpoint are available at https://github.com/studio-ousia/luke.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
ACE2004confidence-orderMicro-F191.9Unverified
AIDA-CoNLLconfidence-orderIn-KB Accuracy95Unverified
AQUAINTconfidence-orderMicro-F193.5Unverified
MSNBCconfidence-orderMicro-F196.3Unverified
WNED-CWEBconfidence-orderMicro-F178.9Unverified
WNED-CWEBMEPMicro-F176.2Unverified
WNED-WIKIMEPMicro-F186.2Unverified
WNED-WIKIconfidence-orderMicro-F189.1Unverified

Reproductions