SOTAVerified

FLERT: Document-Level Features for Named Entity Recognition

2020-11-13Code Available0· sign in to hype

Stefan Schweter, Alan Akbik

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CoNLL 2002 (Dutch)FLERT XLM-RF195.21Unverified
CoNLL 2002 (Spanish)FLERT XLM-RF190.14Unverified
CoNLL 2003 (English)FLERT XLM-RF194.09Unverified
CoNLL 2003 (German)FLERT XLM-RF188.34Unverified
CoNLL 2003 (German) RevisedFLERT XLM-RF192.23Unverified
FindVehicleFLERTF1 Score80.9Unverified

Reproductions