SOTAVerified

Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Showing 251275 of 2874 papers

TitleStatusHype
DeepStruct: Pretraining of Language Models for Structure PredictionCode1
Deep Span Representations for Named Entity RecognitionCode1
A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language ProcessingCode1
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4Code1
ELIT: Emory Language and Information ToolkitCode1
Domain-Specific NER via Retrieving Correlated SamplesCode1
Supplementary Features of BiLSTM for Enhanced Sequence LabelingCode1
A Simple but Effective Approach to Improve Structured Language Model Output for Information ExtractionCode1
Do Syntax Trees Help Pre-trained Transformers Extract Information?Code1
Empirical Analysis of Unlabeled Entity Problem in Named Entity RecognitionCode1
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-TrainingCode1
AraELECTRA: Pre-Training Text Discriminators for Arabic Language UnderstandingCode1
Distantly Supervised Named Entity Recognition via Confidence-Based Multi-Class Positive and Unlabeled LearningCode1
A Sequence-to-Set Network for Nested Named Entity RecognitionCode1
Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media AnalysisCode1
Domain specific BERT representation for Named Entity Recognition of lab protocolCode1
DWIE: an entity-centric dataset for multi-task document-level information extractionCode1
Earnings-21: A Practical Benchmark for ASR in the WildCode1
Efficient Test Time Adapter Ensembling for Low-resource Language VarietiesCode1
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity RecognitionCode1
AraBERT: Transformer-based Model for Arabic Language UnderstandingCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NERCode1
End-to-End Chinese Speaker IdentificationCode1
DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical ResearchCode1
Show:102550
← PrevPage 11 of 115Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACE + document-contextF194.6Unverified
2LUKE 483MF194.3Unverified
3Co-regularized LUKEF194.22Unverified
4LUKE + SubRegWeigh (K-means)F194.2Unverified
5ASP+T5-3BF194.1Unverified
6FLERT XLM-RF194.09Unverified
7PL-MarkerF194Unverified
8CL-KLF193.85Unverified
9XLNet-GCNF193.82Unverified
10RoBERTa + SubRegWeigh (K-means)F193.81Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRC+DSCF192.07Unverified
2PL-MarkerF191.9Unverified
3Baseline + BSF191.74Unverified
4Biaffine-NERF191.3Unverified
5BERT-MRCF191.11Unverified
6PIQNF190.96Unverified
7HGNF190.92Unverified
8Syn-LSTM + BERT (wo doc-context)F190.85Unverified
9DiffusionNERF190.66Unverified
10W2NERF190.5Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERTF189.71Unverified
2SpanModel + SequenceLabelingModelF189.6Unverified
3SciFive-BaseF189.39Unverified
4BLSTM-CNN-Char (SparkNLP)F189.13Unverified
5Spark NLPF189.13Unverified
6KeBioLMF189.1Unverified
7CL-KLF188.96Unverified
8BioKMNER + BioBERTF188.77Unverified
9BioLinkBERT (large)F188.76Unverified
10CompactBioBERTF188.67Unverified
#ModelMetricClaimedVerifiedStatus
1CL-KLF160.45Unverified
2RoBERTa + SubRegWeigh (K-means)F160.29Unverified
3BERT-CRF (Replicated in AdaSeq)F159.69Unverified
4RoBERTa-BiLSTM-contextF159.61Unverified
5BERT + RegLERF158.9Unverified
6TNER -xlm-r-largeF158.5Unverified
7HGNF157.41Unverified
8ASA + RoBERTaF157.3Unverified
9BERTweetF156.5Unverified
10MINERF154.86Unverified
#ModelMetricClaimedVerifiedStatus
1Ours: cross-sentence ALBF190.9Unverified
2GoLLIEF189.6Unverified
3PromptNER [RoBERTa-large]F188.26Unverified
4PIQNF187.42Unverified
5PromptNER [BERT-large]F187.21Unverified
6DiffusionNERF186.93Unverified
7BERT-MRCF186.88Unverified
8UniNER-7BF186.69Unverified
9Locate and LabelF186.67Unverified
10BoningKnifeF185.46Unverified
#ModelMetricClaimedVerifiedStatus
1KeBioLMF182Unverified
2BLSTM-CNN-Char (SparkNLP)F181.29Unverified
3Spark NLPF181.29Unverified
4BINDERF180.3Unverified
5BioMobileBERTF180.13Unverified
6BioLinkBERT (large)F180.06Unverified
7DistilBioBERTF179.97Unverified
8CompactBioBERTF179.88Unverified
9BioDistilBERTF179.1Unverified
10PubMedBERT uncasedF179.1Unverified
#ModelMetricClaimedVerifiedStatus
1BINDERF191.9Unverified
2ConNERF191.3Unverified
3CL-L2F190.99Unverified
4aimpedF190.95Unverified
5BertForTokenClassification (Spark NLP)F190.89Unverified
6BioLinkBERT (large)F190.22Unverified
7ELECTRAMedF190.03Unverified
8Spark NLPF189.73Unverified
9BLSTM-CNN-Char (SparkNLP)F189.73Unverified
10UniNER-7BF189.34Unverified