Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

2016-01-06CONLL 2016Unverified0· sign in to hype

Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji

Unverified — Be the first to reproduce this paper.

Abstract

Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in a document to their correct references in a knowledge base (KB) (e.g., Wikipedia). In this paper, we propose a novel embedding method specifically designed for NED. The proposed method jointly maps words and entities into the same continuous vector space. We extend the skip-gram model by using two models. The KB graph model learns the relatedness of entities using the link structure of the KB, whereas the anchor context model aims to align vectors such that similar words and entities occur close to one another in the vector space by leveraging KB anchors and their context words. By combining contexts based on the proposed embedding with standard NED features, we achieved state-of-the-art accuracy of 93.1% on the standard CoNLL dataset and 85.2% on the TAC 2010 dataset.

Tasks

Entity Disambiguation Entity Linking

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
AIDA-CoNLL	Wikipedia2Vec-GBRT	In-KB Accuracy	93.1	—	Unverified
AIDA-CoNLL	Wikipedia2Vec	In-KB Accuracy	91.5	—	Unverified
TAC2010	Wikipedia2Vec	Micro Precision	85.2	—	Unverified

Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

Abstract

Tasks

Benchmark Results

Reproductions