FLERT: Document-Level Features for Named Entity Recognition
Stefan Schweter, Alan Akbik
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/flairNLP/flairOfficialpytorch★ 14,353
Abstract
Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| CoNLL 2002 (Dutch) | FLERT XLM-R | F1 | 95.21 | — | Unverified |
| CoNLL 2002 (Spanish) | FLERT XLM-R | F1 | 90.14 | — | Unverified |
| CoNLL 2003 (English) | FLERT XLM-R | F1 | 94.09 | — | Unverified |
| CoNLL 2003 (German) | FLERT XLM-R | F1 | 88.34 | — | Unverified |
| CoNLL 2003 (German) Revised | FLERT XLM-R | F1 | 92.23 | — | Unverified |
| FindVehicle | FLERT | F1 Score | 80.9 | — | Unverified |