SOTAVerified

Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Showing 101150 of 2874 papers

TitleStatusHype
TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding Tag/Word Relations and More Fine-Grained TagsCode1
Autoregressive Structured Prediction with Language ModelsCode1
Unsupervised Text DeidentificationCode1
Multi-Granularity Cross-Modality Representation Learning for Named Entity Recognition on Social MediaCode1
End-to-End Entity Detection with Proposer and RegressorCode1
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial DocumentsCode1
Style Transfer as Data Augmentation: A Case Study on Named Entity RecognitionCode1
HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient KoreaCode1
SEE-Few: Seed, Expand and Entail for Few-shot Named Entity RecognitionCode1
Deep Span Representations for Named Entity RecognitionCode1
Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity RecognitionCode1
Distillation-Resistant Watermarking for Model Protection in NLPCode1
COPNER: Contrastive Learning with Prompt Guiding for Few-shot Named Entity RecognitionCode1
METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19 Related TweetsCode1
A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language ProcessingCode1
Application of Deep Learning in Generating Structured Radiology Reports: A Transformer-Based TechniqueCode1
On the Effectiveness of Compact Biomedical TransformersCode1
SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NERCode1
KoCHET: a Korean Cultural Heritage corpus for Entity-related TasksCode1
Optimizing Bi-Encoder for Named Entity Recognition via Contrastive LearningCode1
Domain-Specific NER via Retrieving Correlated SamplesCode1
FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity RecognitionCode1
An Embarrassingly Easy but Strong Baseline for Nested Named Entity RecognitionCode1
Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation ExtractionCode1
End-to-End Chinese Speaker IdentificationCode1
Multi-features based Semantic Augmentation Networks for Named Entity Recognition in Threat IntelligenceCode1
A Label-Aware Autoregressive Framework for Cross-Domain NERCode1
MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguation)Code1
NERDA-Con: Extending NER models for Continual Learning -- Integrating Distinct Tasks and Updating Distribution ShiftsCode1
Endowing Language Models with Multimodal Knowledge Graph RepresentationsCode1
TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language ModelsCode1
SsciBERT: A Pre-trained Language Model for Social Science TextsCode1
Enhanced Entity Annotations for Multilingual CorporaCode1
A Twitter Corpus for Named Entity Recognition in TurkishCode1
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
FinBERT-MRC: financial named entity recognition using BERT under the machine reading comprehension paradigmCode1
RuNNE-2022 Shared Task: Recognizing Nested Named EntitiesCode1
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for MalteseCode1
DeepStruct: Pretraining of Language Models for Structure PredictionCode1
Wojood: Nested Arabic Named Entity Corpus and Recognition using BERTCode1
Hero-Gang Neural Model For Named Entity RecognitionCode1
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language GenerationCode1
NFLAT: Non-Flat-Lattice Transformer for Chinese Named Entity RecognitionCode1
Ontology-Driven and Weakly Supervised Rare Disease Identification from Clinical NotesCode1
Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation ExtractionCode1
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo LanguagesCode1
GNNer: Reducing Overlapping in Span-based NER Using Graph Neural NetworksCode1
Thai Nested Named Entity Recognition CorpusCode1
Polyglot Prompt: Multilingual Multitask PrompTrainingCode1
HiNER: A Large Hindi Named Entity Recognition DatasetCode1
Show:102550
← PrevPage 3 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACE + document-contextF194.6Unverified
2LUKE 483MF194.3Unverified
3Co-regularized LUKEF194.22Unverified
4LUKE + SubRegWeigh (K-means)F194.2Unverified
5ASP+T5-3BF194.1Unverified
6FLERT XLM-RF194.09Unverified
7PL-MarkerF194Unverified
8CL-KLF193.85Unverified
9XLNet-GCNF193.82Unverified
10RoBERTa + SubRegWeigh (K-means)F193.81Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRC+DSCF192.07Unverified
2PL-MarkerF191.9Unverified
3Baseline + BSF191.74Unverified
4Biaffine-NERF191.3Unverified
5BERT-MRCF191.11Unverified
6PIQNF190.96Unverified
7HGNF190.92Unverified
8Syn-LSTM + BERT (wo doc-context)F190.85Unverified
9DiffusionNERF190.66Unverified
10W2NERF190.5Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERTF189.71Unverified
2SpanModel + SequenceLabelingModelF189.6Unverified
3SciFive-BaseF189.39Unverified
4Spark NLPF189.13Unverified
5BLSTM-CNN-Char (SparkNLP)F189.13Unverified
6KeBioLMF189.1Unverified
7CL-KLF188.96Unverified
8BioKMNER + BioBERTF188.77Unverified
9BioLinkBERT (large)F188.76Unverified
10CompactBioBERTF188.67Unverified
#ModelMetricClaimedVerifiedStatus
1CL-KLF160.45Unverified
2RoBERTa + SubRegWeigh (K-means)F160.29Unverified
3BERT-CRF (Replicated in AdaSeq)F159.69Unverified
4RoBERTa-BiLSTM-contextF159.61Unverified
5BERT + RegLERF158.9Unverified
6TNER -xlm-r-largeF158.5Unverified
7HGNF157.41Unverified
8ASA + RoBERTaF157.3Unverified
9BERTweetF156.5Unverified
10MINERF154.86Unverified
#ModelMetricClaimedVerifiedStatus
1Ours: cross-sentence ALBF190.9Unverified
2GoLLIEF189.6Unverified
3PromptNER [RoBERTa-large]F188.26Unverified
4PIQNF187.42Unverified
5PromptNER [BERT-large]F187.21Unverified
6DiffusionNERF186.93Unverified
7BERT-MRCF186.88Unverified
8UniNER-7BF186.69Unverified
9Locate and LabelF186.67Unverified
10BoningKnifeF185.46Unverified
#ModelMetricClaimedVerifiedStatus
1KeBioLMF182Unverified
2BLSTM-CNN-Char (SparkNLP)F181.29Unverified
3Spark NLPF181.29Unverified
4BINDERF180.3Unverified
5BioMobileBERTF180.13Unverified
6BioLinkBERT (large)F180.06Unverified
7DistilBioBERTF179.97Unverified
8CompactBioBERTF179.88Unverified
9BioDistilBERTF179.1Unverified
10PubMedBERT uncasedF179.1Unverified
#ModelMetricClaimedVerifiedStatus
1BINDERF191.9Unverified
2ConNERF191.3Unverified
3CL-L2F190.99Unverified
4aimpedF190.95Unverified
5BertForTokenClassification (Spark NLP)F190.89Unverified
6BioLinkBERT (large)F190.22Unverified
7ELECTRAMedF190.03Unverified
8BLSTM-CNN-Char (SparkNLP)F189.73Unverified
9Spark NLPF189.73Unverified
10UniNER-7BF189.34Unverified