SOTAVerified

Introducing RONEC - the Romanian Named Entity Corpus

2020-05-01LREC 2020Code Available1· sign in to hype

Stefan Daniel Dumitrescu, Andrei-Marius Avram

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We present RONEC - the Named Entity Corpus for the Romanian language. The corpus contains over 26000 entities in 5000 annotated sentences, belonging to 16 distinct classes. The sentences have been extracted from a copy-right free newspaper, covering several styles. This corpus represents the first initiative in the Romanian language space specifically targeted for named entity recognition. It is available in BRAT and CoNLL-U Plus formats, and it is free to use and extend at github.com/dumitrescustefan/ronec

Tasks

Reproductions