SOTAVerified

Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Showing 301350 of 2874 papers

TitleStatusHype
AraELECTRA: Pre-Training Text Discriminators for Arabic Language UnderstandingCode1
Boundary Smoothing for Named Entity RecognitionCode1
A Comparative Study of Pre-trained Encoders for Low-Resource Named Entity RecognitionCode1
Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising LearningCode1
calamanCy: A Tagalog Natural Language Processing ToolkitCode1
CAMeL Tools: An Open Source Python Toolkit for Arabic Natural Language ProcessingCode1
A Label-Aware Autoregressive Framework for Cross-Domain NERCode1
Improving Neural Named Entity Recognition with GazetteersCode1
A Large-Scale Chinese Multimodal NER Dataset with Speech CluesCode1
Causal Distillation for Language ModelsCode1
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
Actionable Entities Recognition Benchmark for Interactive FictionCode1
IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian LanguagesCode1
Information Extraction of Clinical Trial Eligibility CriteriaCode1
A Robust and Domain-Adaptive Approach for Low-Resource Named Entity RecognitionCode1
AlephBERT:A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application WithCode1
Interpretable Multi-dataset Evaluation for Named Entity RecognitionCode1
CHisIEC: An Information Extraction Corpus for Ancient Chinese HistoryCode1
Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequencesCode1
Coach: A Coarse-to-Fine Approach for Cross-domain Slot FillingCode1
A Twitter Corpus for Named Entity Recognition in TurkishCode1
ConNER: Consistency Training for Cross-lingual Named Entity RecognitionCode1
Coarse-to-Fine Pre-training for Named Entity RecognitionCode1
KALA: Knowledge-Augmented Language Model AdaptationCode1
Code and Named Entity Recognition in StackOverflowCode1
A Sequence-to-Set Network for Nested Named Entity RecognitionCode1
A Unified Generative Framework for Various NER SubtasksCode1
Computer Science Named Entity Recognition in the Open Research Knowledge GraphCode1
Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media AnalysisCode1
A Simple but Effective Approach to Improve Structured Language Model Output for Information ExtractionCode1
Computationally Efficient NER Taggers with Combined Embeddings and Constrained DecodingCode1
KoCHET: a Korean Cultural Heritage corpus for Entity-related TasksCode1
Label-Descriptive Patterns and Their Application to Characterizing Classification ErrorsCode1
Label-Guided In-Context Learning for Named Entity RecognitionCode1
SeqScore: Addressing Barriers to Reproducible Named Entity Recognition EvaluationCode1
COPNER: Contrastive Learning with Prompt Guiding for Few-shot Named Entity RecognitionCode1
CONTaiNER: Few-Shot Named Entity Recognition via Contrastive LearningCode1
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity RecognitionCode1
A Unified MRC Framework for Named Entity RecognitionCode1
Entity, Relation, and Event Extraction with Contextualized Span RepresentationsCode1
Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on GeneralizationCode1
Locate and Label: A Two-stage Identifier for Nested Named Entity RecognitionCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
Automated Concatenation of Embeddings for Structured PredictionCode1
COVID-19 Named Entity Recognition for VietnameseCode1
MarkBERT: Marking Word Boundaries Improves Chinese BERTCode1
MASK: A flexible framework to facilitate de-identification of clinical textsCode1
MatSciBERT: A Materials Domain Language Model for Text Mining and Information ExtractionCode1
CrossNER: Evaluating Cross-Domain Named Entity RecognitionCode1
DeepStruct: Pretraining of Language Models for Structure PredictionCode1
Show:102550
← PrevPage 7 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACE + document-contextF194.6Unverified
2LUKE 483MF194.3Unverified
3Co-regularized LUKEF194.22Unverified
4LUKE + SubRegWeigh (K-means)F194.2Unverified
5ASP+T5-3BF194.1Unverified
6FLERT XLM-RF194.09Unverified
7PL-MarkerF194Unverified
8CL-KLF193.85Unverified
9XLNet-GCNF193.82Unverified
10RoBERTa + SubRegWeigh (K-means)F193.81Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRC+DSCF192.07Unverified
2PL-MarkerF191.9Unverified
3Baseline + BSF191.74Unverified
4Biaffine-NERF191.3Unverified
5BERT-MRCF191.11Unverified
6PIQNF190.96Unverified
7HGNF190.92Unverified
8Syn-LSTM + BERT (wo doc-context)F190.85Unverified
9DiffusionNERF190.66Unverified
10W2NERF190.5Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERTF189.71Unverified
2SpanModel + SequenceLabelingModelF189.6Unverified
3SciFive-BaseF189.39Unverified
4Spark NLPF189.13Unverified
5BLSTM-CNN-Char (SparkNLP)F189.13Unverified
6KeBioLMF189.1Unverified
7CL-KLF188.96Unverified
8BioKMNER + BioBERTF188.77Unverified
9BioLinkBERT (large)F188.76Unverified
10CompactBioBERTF188.67Unverified
#ModelMetricClaimedVerifiedStatus
1CL-KLF160.45Unverified
2RoBERTa + SubRegWeigh (K-means)F160.29Unverified
3BERT-CRF (Replicated in AdaSeq)F159.69Unverified
4RoBERTa-BiLSTM-contextF159.61Unverified
5BERT + RegLERF158.9Unverified
6TNER -xlm-r-largeF158.5Unverified
7HGNF157.41Unverified
8ASA + RoBERTaF157.3Unverified
9BERTweetF156.5Unverified
10MINERF154.86Unverified
#ModelMetricClaimedVerifiedStatus
1Ours: cross-sentence ALBF190.9Unverified
2GoLLIEF189.6Unverified
3PromptNER [RoBERTa-large]F188.26Unverified
4PIQNF187.42Unverified
5PromptNER [BERT-large]F187.21Unverified
6DiffusionNERF186.93Unverified
7BERT-MRCF186.88Unverified
8UniNER-7BF186.69Unverified
9Locate and LabelF186.67Unverified
10BoningKnifeF185.46Unverified
#ModelMetricClaimedVerifiedStatus
1KeBioLMF182Unverified
2BLSTM-CNN-Char (SparkNLP)F181.29Unverified
3Spark NLPF181.29Unverified
4BINDERF180.3Unverified
5BioMobileBERTF180.13Unverified
6BioLinkBERT (large)F180.06Unverified
7DistilBioBERTF179.97Unverified
8CompactBioBERTF179.88Unverified
9BioDistilBERTF179.1Unverified
10PubMedBERT uncasedF179.1Unverified
#ModelMetricClaimedVerifiedStatus
1BINDERF191.9Unverified
2ConNERF191.3Unverified
3CL-L2F190.99Unverified
4aimpedF190.95Unverified
5BertForTokenClassification (Spark NLP)F190.89Unverified
6BioLinkBERT (large)F190.22Unverified
7ELECTRAMedF190.03Unverified
8BLSTM-CNN-Char (SparkNLP)F189.73Unverified
9Spark NLPF189.73Unverified
10UniNER-7BF189.34Unverified