Reconstructing NER Corpora: a Case Study on Bulgarian
2020-05-01LREC 2020Unverified0· sign in to hype
Iva Marinova, Laska Laskova, Petya Osenova, Kiril Simov, Alex Popov, er
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
The paper reports on the usage of deep learning methods for improving a Named Entity Recognition (NER) training corpus and for predicting and annotating new types in a test corpus. We show how the annotations in a type-based corpus of named entities (NE) were populated as occurrences within it, thus ensuring density of the training information. A deep learning model was adopted for discovering inconsistencies in the initial annotation and for learning new NE types. The evaluation results get improved after data curation, randomization and deduplication.