Named Entity Recognition (NER)
Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.
Example:
| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |
( Image credit: Zalando )
Papers
Showing 1–10 of 2874 papers
All datasetsCoNLL 2003 (English)Ontonotes v5 (English)NCBI DiseaseWNUT 2017ACE 2005JNLPBABC5CDRGENIABC2GMBC5CDR-chemicalSLUECoNLL++
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | W2V2-L-LL60K (pipeline approach, uses LM) | F1 (%) | 69.6 | — | Unverified |
| 2 | W2V2-B-LS960 (pipeline approach, uses LM) | F1 (%) | 68 | — | Unverified |
| 3 | Wav2Seq (from HuBERT-large) | F1 (%) | 65.4 | — | Unverified |
| 4 | W2V2-L-LL60K (e2e approach, uses LM) | F1 (%) | 64.8 | — | Unverified |
| 5 | W2V2-B-LS960 (e2e approach, uses LM) | F1 (%) | 63.4 | — | Unverified |
| 6 | HuBERT-B-LS960 (e2e approach, uses LM) | F1 (%) | 61.9 | — | Unverified |
| 7 | W2V2-B-VP100K (e2e approach, uses LM) | F1 (%) | 61.8 | — | Unverified |
| 8 | W2V2-L-LL60K (pipeline approach) | F1 (%) | 57.8 | — | Unverified |
| 9 | W2V2-L-LL60K (e2e approach) | F1 (%) | 50.9 | — | Unverified |
| 10 | W2V2-B-LS960 (e2e approach) | F1 (%) | 50.2 | — | Unverified |