SOTAVerified

Mongolian Named Entity Recognition System with Rich Features

2016-12-01COLING 2016Unverified0· sign in to hype

Weihua Wang, Feilong Bao, Guanglai Gao

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this paper, we first build a manually annotated named entity corpus of Mongolian. Then, we propose three morphological processing methods and study comprehensive features, including syllable features, lexical features, context features, morphological features and semantic features in Mongolian named entity recognition. Moreover, we also evaluate the influence of word cluster features on the system and combine all features together eventually. The experimental result shows that segmenting each suffix into an individual token achieves better results than deleting suffixes or using the suffixes as feature. The system based on segmenting suffixes with all proposed features yields benchmark result of F-measure=84.65 on this corpus.

Tasks

Reproductions