Cross-lingual Extended Named Entity Classification of Wikipedia Articles

2020-10-07Unverified0· sign in to hype

The Viet Bui, Phuong Le-Hong

Unverified — Be the first to reproduce this paper.

Abstract

The FPT.AI team participated in the SHINRA2020-ML subtask of the NTCIR-15 SHINRA task. This paper describes our method to solving the problem and discusses the official results. Our method focuses on learning cross-lingual representations, both on the word level and document level for page classification. We propose a three-stage approach including multilingual model pre-training, monolingual model fine-tuning and cross-lingual voting. Our system is able to achieve the best scores for 25 out of 30 languages; and its accuracy gaps to the best performing systems of the other five languages are relatively small.

Tasks

Articles Classification General Classification

Cross-lingual Extended Named Entity Classification of Wikipedia Articles

Abstract

Tasks

Reproductions