SOTAVerified

Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Showing 150 of 2874 papers

TitleStatusHype
Do "English" Named Entity Recognizers Work Well on Global Englishes?Code5
Biomedical and Clinical English Model Packages in the Stanza Python NLP LibraryCode3
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingCode3
TENER: Adapting Transformer Encoder for Named Entity RecognitionCode3
ERNIE 2.0: A Continual Pre-training Framework for Language UnderstandingCode3
Accurate clinical and biomedical Named entity recognition at scaleCode3
Ludwig: a type-based declarative deep learning toolboxCode3
A Survey of Large Language Models in Finance (FinLLMs)Code3
WhisperNER: Unified Open Named Entity and Speech RecognitionCode3
N-LTP: An Open-source Neural Language Technology Platform for ChineseCode3
Pre-Training with Whole Word Masking for Chinese BERTCode3
ERNIE: Enhanced Representation through Knowledge IntegrationCode3
GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionCode2
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual TransferCode2
Language Modelling with PixelsCode2
Sensitive Data Detection with High-Throughput Neural Network Models for Financial InstitutionsCode2
ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPTCode2
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity RecognitionCode2
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt LearningCode2
TechGPT-2.0: A large language model project to solve the task of knowledge graph constructionCode2
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for FinanceCode2
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and ChallengesCode2
LinkBERT: Pretraining Language Models with Document LinksCode2
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal ShiftsCode2
T-NER: An All-Round Python Library for Transformer-based Named Entity RecognitionCode2
Rethinking Negative Instances for Generative Named Entity RecognitionCode2
FLAT: Chinese NER Using Flat-Lattice TransformerCode2
Decomposed Meta-Learning for Few-Shot Named Entity RecognitionCode2
Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRsCode2
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity RecognitionCode2
BERN2: an advanced neural biomedical named entity recognition and normalization toolCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional TransformerCode2
CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for ChineseCode2
TweetNLP: Cutting-Edge Natural Language Processing for Social MediaCode2
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity RecognitionCode1
A Simple but Effective Approach to Improve Structured Language Model Output for Information ExtractionCode1
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and DatasetsCode1
Can images help recognize entities? A study of the role of images for Multimodal NERCode1
A Sequence-to-Set Network for Nested Named Entity RecognitionCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
AraBERT: Transformer-based Model for Arabic Language UnderstandingCode1
Application of Deep Learning in Generating Structured Radiology Reports: A Transformer-Based TechniqueCode1
AraELECTRA: Pre-Training Text Discriminators for Arabic Language UnderstandingCode1
An Incremental Parser for Abstract Meaning RepresentationCode1
An Enhanced Span-based Decomposition Method for Few-Shot Sequence LabelingCode1
An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and NormalizationCode1
A Neural Span-Based Continual Named Entity Recognition ModelCode1
A Neural Transition-based Model for Nested Mention RecognitionCode1
Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media AnalysisCode1
Show:102550
← PrevPage 1 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACE + document-contextF194.6Unverified
2LUKE 483MF194.3Unverified
3Co-regularized LUKEF194.22Unverified
4LUKE + SubRegWeigh (K-means)F194.2Unverified
5ASP+T5-3BF194.1Unverified
6FLERT XLM-RF194.09Unverified
7PL-MarkerF194Unverified
8CL-KLF193.85Unverified
9XLNet-GCNF193.82Unverified
10RoBERTa + SubRegWeigh (K-means)F193.81Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRC+DSCF192.07Unverified
2PL-MarkerF191.9Unverified
3Baseline + BSF191.74Unverified
4Biaffine-NERF191.3Unverified
5BERT-MRCF191.11Unverified
6PIQNF190.96Unverified
7HGNF190.92Unverified
8Syn-LSTM + BERT (wo doc-context)F190.85Unverified
9DiffusionNERF190.66Unverified
10W2NERF190.5Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERTF189.71Unverified
2SpanModel + SequenceLabelingModelF189.6Unverified
3SciFive-BaseF189.39Unverified
4Spark NLPF189.13Unverified
5BLSTM-CNN-Char (SparkNLP)F189.13Unverified
6KeBioLMF189.1Unverified
7CL-KLF188.96Unverified
8BioKMNER + BioBERTF188.77Unverified
9BioLinkBERT (large)F188.76Unverified
10CompactBioBERTF188.67Unverified
#ModelMetricClaimedVerifiedStatus
1CL-KLF160.45Unverified
2RoBERTa + SubRegWeigh (K-means)F160.29Unverified
3BERT-CRF (Replicated in AdaSeq)F159.69Unverified
4RoBERTa-BiLSTM-contextF159.61Unverified
5BERT + RegLERF158.9Unverified
6TNER -xlm-r-largeF158.5Unverified
7HGNF157.41Unverified
8ASA + RoBERTaF157.3Unverified
9BERTweetF156.5Unverified
10MINERF154.86Unverified
#ModelMetricClaimedVerifiedStatus
1Ours: cross-sentence ALBF190.9Unverified
2GoLLIEF189.6Unverified
3PromptNER [RoBERTa-large]F188.26Unverified
4PIQNF187.42Unverified
5PromptNER [BERT-large]F187.21Unverified
6DiffusionNERF186.93Unverified
7BERT-MRCF186.88Unverified
8UniNER-7BF186.69Unverified
9Locate and LabelF186.67Unverified
10BoningKnifeF185.46Unverified
#ModelMetricClaimedVerifiedStatus
1KeBioLMF182Unverified
2BLSTM-CNN-Char (SparkNLP)F181.29Unverified
3Spark NLPF181.29Unverified
4BINDERF180.3Unverified
5BioMobileBERTF180.13Unverified
6BioLinkBERT (large)F180.06Unverified
7DistilBioBERTF179.97Unverified
8CompactBioBERTF179.88Unverified
9BioDistilBERTF179.1Unverified
10PubMedBERT uncasedF179.1Unverified
#ModelMetricClaimedVerifiedStatus
1BINDERF191.9Unverified
2ConNERF191.3Unverified
3CL-L2F190.99Unverified
4aimpedF190.95Unverified
5BertForTokenClassification (Spark NLP)F190.89Unverified
6BioLinkBERT (large)F190.22Unverified
7ELECTRAMedF190.03Unverified
8BLSTM-CNN-Char (SparkNLP)F189.73Unverified
9Spark NLPF189.73Unverified
10UniNER-7BF189.34Unverified