SOTAVerified

Language-Independent Named Entity Analysis Using Parallel Projection and Rule-Based Disambiguation

2017-04-01WS 2017Unverified0· sign in to hype

James Mayfield, Paul McNamee, Cash Costello

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The 2017 shared task at the Balto-Slavic NLP workshop requires identifying coarse-grained named entities in seven languages, identifying each entity's base form, and clustering name mentions across the multilingual set of documents. The fact that no training data is provided to systems for building supervised classifiers further adds to the complexity. To complete the task we first use publicly available parallel texts to project named entity recognition capability from English to each evaluation language. We ignore entirely the subtask of identifying non-inflected forms of names. Finally, we create cross-document entity identifiers by clustering named mentions using a procedure-based approach.

Tasks

Reproductions