SOTAVerified

Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Showing 150 of 2874 papers

TitleStatusHype
Do "English" Named Entity Recognizers Work Well on Global Englishes?Code5
WhisperNER: Unified Open Named Entity and Speech RecognitionCode3
A Survey of Large Language Models in Finance (FinLLMs)Code3
Accurate clinical and biomedical Named entity recognition at scaleCode3
N-LTP: An Open-source Neural Language Technology Platform for ChineseCode3
Biomedical and Clinical English Model Packages in the Stanza Python NLP LibraryCode3
TENER: Adapting Transformer Encoder for Named Entity RecognitionCode3
Ludwig: a type-based declarative deep learning toolboxCode3
ERNIE 2.0: A Continual Pre-training Framework for Language UnderstandingCode3
Pre-Training with Whole Word Masking for Chinese BERTCode3
ERNIE: Enhanced Representation through Knowledge IntegrationCode3
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingCode3
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and ChallengesCode2
Rethinking Negative Instances for Generative Named Entity RecognitionCode2
TechGPT-2.0: A large language model project to solve the task of knowledge graph constructionCode2
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional TransformerCode2
GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionCode2
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity RecognitionCode2
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for FinanceCode2
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity RecognitionCode2
ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPTCode2
Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRsCode2
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal ShiftsCode2
T-NER: An All-Round Python Library for Transformer-based Named Entity RecognitionCode2
Language Modelling with PixelsCode2
TweetNLP: Cutting-Edge Natural Language Processing for Social MediaCode2
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt LearningCode2
Decomposed Meta-Learning for Few-Shot Named Entity RecognitionCode2
LinkBERT: Pretraining Language Models with Document LinksCode2
BERN2: an advanced neural biomedical named entity recognition and normalization toolCode2
Sensitive Data Detection with High-Throughput Neural Network Models for Financial InstitutionsCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual TransferCode2
FLAT: Chinese NER Using Flat-Lattice TransformerCode2
CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for ChineseCode2
Label-Guided In-Context Learning for Named Entity RecognitionCode1
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity RecognitionCode1
Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical DocumentsCode1
Small Language Model Makes an Effective Long Text ExtractorCode1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training DataCode1
Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language ModelsCode1
Retrieval Augmented Instruction Tuning for Open NER with Large Language ModelsCode1
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity RecognitionCode1
Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity RecognitionCode1
OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage PruningCode1
REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity LinkingCode1
CHisIEC: An Information Extraction Corpus for Ancient Chinese HistoryCode1
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language ModelsCode1
VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language ModelsCode1
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated DataCode1
Show:102550
← PrevPage 1 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACE + document-contextF194.6Unverified
2LUKE 483MF194.3Unverified
3Co-regularized LUKEF194.22Unverified
4LUKE + SubRegWeigh (K-means)F194.2Unverified
5ASP+T5-3BF194.1Unverified
6FLERT XLM-RF194.09Unverified
7PL-MarkerF194Unverified
8CL-KLF193.85Unverified
9XLNet-GCNF193.82Unverified
10RoBERTa + SubRegWeigh (K-means)F193.81Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRC+DSCF192.07Unverified
2PL-MarkerF191.9Unverified
3Baseline + BSF191.74Unverified
4Biaffine-NERF191.3Unverified
5BERT-MRCF191.11Unverified
6PIQNF190.96Unverified
7HGNF190.92Unverified
8Syn-LSTM + BERT (wo doc-context)F190.85Unverified
9DiffusionNERF190.66Unverified
10W2NERF190.5Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERTF189.71Unverified
2SpanModel + SequenceLabelingModelF189.6Unverified
3SciFive-BaseF189.39Unverified
4BLSTM-CNN-Char (SparkNLP)F189.13Unverified
5Spark NLPF189.13Unverified
6KeBioLMF189.1Unverified
7CL-KLF188.96Unverified
8BioKMNER + BioBERTF188.77Unverified
9BioLinkBERT (large)F188.76Unverified
10CompactBioBERTF188.67Unverified
#ModelMetricClaimedVerifiedStatus
1CL-KLF160.45Unverified
2RoBERTa + SubRegWeigh (K-means)F160.29Unverified
3BERT-CRF (Replicated in AdaSeq)F159.69Unverified
4RoBERTa-BiLSTM-contextF159.61Unverified
5BERT + RegLERF158.9Unverified
6TNER -xlm-r-largeF158.5Unverified
7HGNF157.41Unverified
8ASA + RoBERTaF157.3Unverified
9BERTweetF156.5Unverified
10MINERF154.86Unverified
#ModelMetricClaimedVerifiedStatus
1Ours: cross-sentence ALBF190.9Unverified
2GoLLIEF189.6Unverified
3PromptNER [RoBERTa-large]F188.26Unverified
4PIQNF187.42Unverified
5PromptNER [BERT-large]F187.21Unverified
6DiffusionNERF186.93Unverified
7BERT-MRCF186.88Unverified
8UniNER-7BF186.69Unverified
9Locate and LabelF186.67Unverified
10BoningKnifeF185.46Unverified
#ModelMetricClaimedVerifiedStatus
1KeBioLMF182Unverified
2BLSTM-CNN-Char (SparkNLP)F181.29Unverified
3Spark NLPF181.29Unverified
4BINDERF180.3Unverified
5BioMobileBERTF180.13Unverified
6BioLinkBERT (large)F180.06Unverified
7DistilBioBERTF179.97Unverified
8CompactBioBERTF179.88Unverified
9BioDistilBERTF179.1Unverified
10PubMedBERT uncasedF179.1Unverified
#ModelMetricClaimedVerifiedStatus
1BINDERF191.9Unverified
2ConNERF191.3Unverified
3CL-L2F190.99Unverified
4aimpedF190.95Unverified
5BertForTokenClassification (Spark NLP)F190.89Unverified
6BioLinkBERT (large)F190.22Unverified
7ELECTRAMedF190.03Unverified
8Spark NLPF189.73Unverified
9BLSTM-CNN-Char (SparkNLP)F189.73Unverified
10UniNER-7BF189.34Unverified