SOTAVerified

Illinois Cross-Lingual Wikifier: Grounding Entities in Many Languages to the English Wikipedia

2016-12-01COLING 2016Unverified0· sign in to hype

Chen-Tse Tsai, Dan Roth

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We release a cross-lingual wikification system for all languages in Wikipedia. Given a piece of text in any supported language, the system identifies names of people, locations, organizations, and grounds these names to the corresponding English Wikipedia entries. The system is based on two components: a cross-lingual named entity recognition (NER) model and a cross-lingual mention grounding model. The cross-lingual NER model is a language-independent model which can extract named entity mentions in the text of any language in Wikipedia. The extracted mentions are then grounded to the English Wikipedia using the cross-lingual mention grounding model. The only resources required to train the proposed system are the multilingual Wikipedia dump and existing training data for English NER. The system is online at http://cogcomp.cs.illinois.edu/page/demo_view/xl_wikifier

Tasks

Reproductions