SOTAVerified

Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Showing 51100 of 2874 papers

TitleStatusHype
A Simple but Effective Approach to Improve Structured Language Model Output for Information ExtractionCode1
PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity RecognitionCode1
The Radiation Oncology NLP DatabaseCode1
Filtered Semi-Markov CRFCode1
GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and DatasetsCode1
Self-Improving for Zero-Shot Named Entity Recognition with Large Language ModelsCode1
Universal NER: A Gold-Standard Multilingual Named Entity Recognition BenchmarkCode1
calamanCy: A Tagalog Natural Language Processing ToolkitCode1
Developing a Named Entity Recognition Dataset for TagalogCode1
CleanCoNLL: A Nearly Noise-Free Named Entity Recognition DatasetCode1
NERetrieve: Dataset for Next Generation Named Entity Recognition and RetrievalCode1
HEProto: A Hierarchical Enhancing ProtoNet based on Multi-Task Learning for Few-shot Named Entity RecognitionCode1
Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained DatasetsCode1
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path PredictionCode1
Empirical Study of Zero-Shot NER with ChatGPTCode1
Label Supervised LLaMA FinetuningCode1
DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical ResearchCode1
Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP PipelinesCode1
ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NERCode1
Supplementary Features of BiLSTM for Enhanced Sequence LabelingCode1
E-NER: Evidential Deep Learning for Trustworthy Named Entity RecognitionCode1
PromptNER: Prompt Locating and Typing for Named Entity RecognitionCode1
DiffusionNER: Boundary Diffusion for Named Entity RecognitionCode1
PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor SearchCode1
From Zero to Hero: Harnessing Transformers for Biomedical Named Entity Recognition in Zero- and Few-shot ContextsCode1
ViMQ: A Vietnamese Medical Question Dataset for Healthcare Dialogue System DevelopmentCode1
FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval systemCode1
EasyNER: A Customizable Easy-to-Use Pipeline for Deep Learning- and Dictionary-based Named Entity Recognition from Medical TextCode1
Improving Large Language Models for Clinical Named Entity Recognition via Prompt EngineeringCode1
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4Code1
A Human Subject Study of Named Entity Recognition (NER) in Conversational Music Recommendation QueriesCode1
A Neural Span-Based Continual Named Entity Recognition ModelCode1
FiNER-ORD: Financial Named Entity Recognition Open Research DatasetCode1
Meta-Learning Triplet Network with Adaptive Margins for Few-Shot Named Entity RecognitionCode1
Type-Aware Decomposed Framework for Few-Shot Named Entity RecognitionCode1
Lightweight Transformers for Clinical Natural Language ProcessingCode1
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure ParsingCode1
Bioformer: an efficient transformer language model for biomedical text miningCode1
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
ViDeBERTa: A powerful pre-trained language model for VietnameseCode1
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic LanguagesCode1
AIONER: All-in-one scheme-based biomedical named entity recognition using deep learningCode1
Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation TaggingCode1
PUnifiedNER: A Prompting-based Unified NER System for Diverse DatasetsCode1
Hengam: An Adversarially Trained Transformer for Persian Temporal TaggingCode1
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and AugmentationCode1
ConNER: Consistency Training for Cross-lingual Named Entity RecognitionCode1
Prompt-Based Metric Learning for Few-Shot NERCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
Named Entity Recognition in Indian court judgmentsCode1
Show:102550
← PrevPage 2 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACE + document-contextF194.6Unverified
2LUKE 483MF194.3Unverified
3Co-regularized LUKEF194.22Unverified
4LUKE + SubRegWeigh (K-means)F194.2Unverified
5ASP+T5-3BF194.1Unverified
6FLERT XLM-RF194.09Unverified
7PL-MarkerF194Unverified
8CL-KLF193.85Unverified
9XLNet-GCNF193.82Unverified
10RoBERTa + SubRegWeigh (K-means)F193.81Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRC+DSCF192.07Unverified
2PL-MarkerF191.9Unverified
3Baseline + BSF191.74Unverified
4Biaffine-NERF191.3Unverified
5BERT-MRCF191.11Unverified
6PIQNF190.96Unverified
7HGNF190.92Unverified
8Syn-LSTM + BERT (wo doc-context)F190.85Unverified
9DiffusionNERF190.66Unverified
10W2NERF190.5Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERTF189.71Unverified
2SpanModel + SequenceLabelingModelF189.6Unverified
3SciFive-BaseF189.39Unverified
4BLSTM-CNN-Char (SparkNLP)F189.13Unverified
5Spark NLPF189.13Unverified
6KeBioLMF189.1Unverified
7CL-KLF188.96Unverified
8BioKMNER + BioBERTF188.77Unverified
9BioLinkBERT (large)F188.76Unverified
10CompactBioBERTF188.67Unverified
#ModelMetricClaimedVerifiedStatus
1CL-KLF160.45Unverified
2RoBERTa + SubRegWeigh (K-means)F160.29Unverified
3BERT-CRF (Replicated in AdaSeq)F159.69Unverified
4RoBERTa-BiLSTM-contextF159.61Unverified
5BERT + RegLERF158.9Unverified
6TNER -xlm-r-largeF158.5Unverified
7HGNF157.41Unverified
8ASA + RoBERTaF157.3Unverified
9BERTweetF156.5Unverified
10MINERF154.86Unverified
#ModelMetricClaimedVerifiedStatus
1Ours: cross-sentence ALBF190.9Unverified
2GoLLIEF189.6Unverified
3PromptNER [RoBERTa-large]F188.26Unverified
4PIQNF187.42Unverified
5PromptNER [BERT-large]F187.21Unverified
6DiffusionNERF186.93Unverified
7BERT-MRCF186.88Unverified
8UniNER-7BF186.69Unverified
9Locate and LabelF186.67Unverified
10BoningKnifeF185.46Unverified
#ModelMetricClaimedVerifiedStatus
1KeBioLMF182Unverified
2BLSTM-CNN-Char (SparkNLP)F181.29Unverified
3Spark NLPF181.29Unverified
4BINDERF180.3Unverified
5BioMobileBERTF180.13Unverified
6BioLinkBERT (large)F180.06Unverified
7DistilBioBERTF179.97Unverified
8CompactBioBERTF179.88Unverified
9BioDistilBERTF179.1Unverified
10PubMedBERT uncasedF179.1Unverified
#ModelMetricClaimedVerifiedStatus
1BINDERF191.9Unverified
2ConNERF191.3Unverified
3CL-L2F190.99Unverified
4aimpedF190.95Unverified
5BertForTokenClassification (Spark NLP)F190.89Unverified
6BioLinkBERT (large)F190.22Unverified
7ELECTRAMedF190.03Unverified
8Spark NLPF189.73Unverified
9BLSTM-CNN-Char (SparkNLP)F189.73Unverified
10UniNER-7BF189.34Unverified