Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

| Mark | Watney | visited | Mars | | --- | ---| --- | --- | | B-PER | I-PER | O | B-LOC |

( Image credit: Zalando )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 2874 papers

Title	Date	Tasks	Status	Hype
Do "English" Named Entity Recognizers Work Well on Global Englishes?	Apr 20, 2024	named-entity-recognitionNamed Entity Recognition	CodeCode Available	5
Biomedical and Clinical English Model Packages in the Stanza Python NLP Library	Jul 29, 2020	GPUNamed Entity Recognition	CodeCode Available	3
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding	Oct 11, 2018	Citation Intent ClassificationCommon Sense Reasoning	CodeCode Available	3
TENER: Adapting Transformer Encoder for Named Entity Recognition	Nov 10, 2019	Chinese Named Entity RecognitionNamed Entity Recognition	CodeCode Available	3
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding	Jul 29, 2019	Chinese Named Entity RecognitionChinese Reading Comprehension	CodeCode Available	3
Accurate clinical and biomedical Named entity recognition at scale	Jul 19, 2022	Clinical Concept ExtractionDe-identification	CodeCode Available	3
Ludwig: a type-based declarative deep learning toolbox	Sep 17, 2019	DecoderDeep Learning	CodeCode Available	3
A Survey of Large Language Models in Finance (FinLLMs)	Feb 4, 2024	Named Entity Recognition (NER)Question Answering	CodeCode Available	3
WhisperNER: Unified Open Named Entity and Speech Recognition	Sep 12, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3
N-LTP: An Open-source Neural Language Technology Platform for Chinese	Sep 24, 2020	Chinese Word SegmentationDependency Parsing	CodeCode Available	3
Pre-Training with Whole Word Masking for Chinese BERT	Jun 19, 2019	Document ClassificationGeneral Classification	CodeCode Available	3
ERNIE: Enhanced Representation through Knowledge Integration	Apr 19, 2019	Chinese Named Entity RecognitionChinese Sentence Pair Classification	CodeCode Available	3
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction	Oct 5, 2023	Event Argument ExtractionEvent Extraction	CodeCode Available	2
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer	Apr 30, 2020	Cross-Lingual Transfernamed-entity-recognition	CodeCode Available	2
Language Modelling with Pixels	Jul 14, 2022	Language ModellingNamed Entity Recognition	CodeCode Available	2
Sensitive Data Detection with High-Throughput Neural Network Models for Financial Institutions	Dec 17, 2020	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT	Feb 20, 2023	Event Extractionnamed-entity-recognition	CodeCode Available	2
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition	Aug 7, 2023	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning	May 29, 2022	Few-Shot Text ClassificationMemorization	CodeCode Available	2
TechGPT-2.0: A large language model project to solve the task of knowledge graph construction	Jan 9, 2024	graph constructionLanguage Modeling	CodeCode Available	2
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance	Jun 8, 2023	Conversational Question AnsweringLanguage Modeling	CodeCode Available	2
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges	Apr 24, 2024	Drug DesignInductive Bias	CodeCode Available	2
LinkBERT: Pretraining Language Models with Document Links	Mar 29, 2022	Document ClassificationLanguage Modeling	CodeCode Available	2
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts	Oct 7, 2022	ArticlesLanguage Modeling	CodeCode Available	2
T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition	Sep 9, 2022	AllDomain Generalization	CodeCode Available	2
Rethinking Negative Instances for Generative Named Entity Recognition	Feb 26, 2024	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
FLAT: Chinese NER Using Flat-Lattice Transformer	Apr 24, 2020	Chinese Named Entity Recognitionnamed-entity-recognition	CodeCode Available	2
Decomposed Meta-Learning for Few-Shot Named Entity Recognition	Apr 12, 2022	Entity TypingFew-shot NER	CodeCode Available	2
Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRs	Dec 13, 2022	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition	May 24, 2023	DenoisingKnowledge Distillation	CodeCode Available	2
BERN2: an advanced neural biomedical named entity recognition and normalization tool	Jan 6, 2022	graph constructionnamed-entity-recognition	CodeCode Available	2
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	Jun 5, 2020	Common Sense ReasoningCoreference Resolution	CodeCode Available	2
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer	Nov 14, 2023	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for Chinese	Jan 13, 2020	Chinese Named Entity Recognitionnamed-entity-recognition	CodeCode Available	2
TweetNLP: Cutting-Edge Natural Language Processing for Social Media	Jun 29, 2022	Language IdentificationNamed Entity Recognition	CodeCode Available	2
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition	Jun 28, 2021	named-entity-recognitionNamed Entity Recognition	CodeCode Available	1
A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction	Feb 20, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets	Apr 19, 2022	Dialogue Act ClassificationDialogue Understanding	CodeCode Available	1
Can images help recognize entities? A study of the role of images for Multimodal NER	Oct 23, 2020	Image Captioningnamed-entity-recognition	CodeCode Available	1
A Sequence-to-Set Network for Nested Named Entity Recognition	May 19, 2021	Decodernamed-entity-recognition	CodeCode Available	1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications	Nov 8, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
AraBERT: Transformer-based Model for Arabic Language Understanding	Feb 28, 2020	modelnamed-entity-recognition	CodeCode Available	1
Application of Deep Learning in Generating Structured Radiology Reports: A Transformer-Based Technique	Sep 25, 2022	named-entity-recognitionNamed Entity Recognition	CodeCode Available	1
AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding	Dec 31, 2020	Language ModelingLanguage Modelling	CodeCode Available	1
An Incremental Parser for Abstract Meaning Representation	Aug 22, 2016	Abstract Meaning RepresentationAMR Parsing	CodeCode Available	1
An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling	Sep 27, 2021	Few-shot NERMeta-Learning	CodeCode Available	1
An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization	Aug 1, 2021	Knowledge GraphsMedical Named Entity Recognition	CodeCode Available	1
A Neural Span-Based Continual Named Entity Recognition Model	Feb 23, 2023	Continual LearningContinual Named Entity Recognition	CodeCode Available	1
A Neural Transition-based Model for Nested Mention Recognition	Oct 3, 2018	Named Entity Recognition (NER)Nested Mention Recognition	CodeCode Available	1
Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis	Jan 18, 2022	Dependency Parsingnamed-entity-recognition	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 58Next →

All datasets CoNLL 2003 (English)Ontonotes v5 (English)NCBI Disease WNUT 2017 ACE 2005 JNLPBA BC5CDR GENIA BC2GM BC5CDR-chemical SLUE CoNLL++

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACE + document-context	F1	94.6	—	Unverified
2	LUKE 483M	F1	94.3	—	Unverified
3	Co-regularized LUKE	F1	94.22	—	Unverified
4	LUKE + SubRegWeigh (K-means)	F1	94.2	—	Unverified
5	ASP+T5-3B	F1	94.1	—	Unverified
6	FLERT XLM-R	F1	94.09	—	Unverified
7	PL-Marker	F1	94	—	Unverified
8	CL-KL	F1	93.85	—	Unverified
9	XLNet-GCN	F1	93.82	—	Unverified
10	RoBERTa + SubRegWeigh (K-means)	F1	93.81	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT-MRC+DSC	F1	92.07	—	Unverified
2	PL-Marker	F1	91.9	—	Unverified
3	Baseline + BS	F1	91.74	—	Unverified
4	Biaffine-NER	F1	91.3	—	Unverified
5	BERT-MRC	F1	91.11	—	Unverified
6	PIQN	F1	90.96	—	Unverified
7	HGN	F1	90.92	—	Unverified
8	Syn-LSTM + BERT (wo doc-context)	F1	90.85	—	Unverified
9	DiffusionNER	F1	90.66	—	Unverified
10	W2NER	F1	90.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BioBERT	F1	89.71	—	Unverified
2	SpanModel + SequenceLabelingModel	F1	89.6	—	Unverified
3	SciFive-Base	F1	89.39	—	Unverified
4	Spark NLP	F1	89.13	—	Unverified
5	BLSTM-CNN-Char (SparkNLP)	F1	89.13	—	Unverified
6	KeBioLM	F1	89.1	—	Unverified
7	CL-KL	F1	88.96	—	Unverified
8	BioKMNER + BioBERT	F1	88.77	—	Unverified
9	BioLinkBERT (large)	F1	88.76	—	Unverified
10	CompactBioBERT	F1	88.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CL-KL	F1	60.45	—	Unverified
2	RoBERTa + SubRegWeigh (K-means)	F1	60.29	—	Unverified
3	BERT-CRF (Replicated in AdaSeq)	F1	59.69	—	Unverified
4	RoBERTa-BiLSTM-context	F1	59.61	—	Unverified
5	BERT + RegLER	F1	58.9	—	Unverified
6	TNER -xlm-r-large	F1	58.5	—	Unverified
7	HGN	F1	57.41	—	Unverified
8	ASA + RoBERTa	F1	57.3	—	Unverified
9	BERTweet	F1	56.5	—	Unverified
10	MINER	F1	54.86	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Ours: cross-sentence ALB	F1	90.9	—	Unverified
2	GoLLIE	F1	89.6	—	Unverified
3	PromptNER [RoBERTa-large]	F1	88.26	—	Unverified
4	PIQN	F1	87.42	—	Unverified
5	PromptNER [BERT-large]	F1	87.21	—	Unverified
6	DiffusionNER	F1	86.93	—	Unverified
7	BERT-MRC	F1	86.88	—	Unverified
8	UniNER-7B	F1	86.69	—	Unverified
9	Locate and Label	F1	86.67	—	Unverified
10	BoningKnife	F1	85.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	KeBioLM	F1	82	—	Unverified
2	Spark NLP	F1	81.29	—	Unverified
3	BLSTM-CNN-Char (SparkNLP)	F1	81.29	—	Unverified
4	BINDER	F1	80.3	—	Unverified
5	BioMobileBERT	F1	80.13	—	Unverified
6	BioLinkBERT (large)	F1	80.06	—	Unverified
7	DistilBioBERT	F1	79.97	—	Unverified
8	CompactBioBERT	F1	79.88	—	Unverified
9	BioDistilBERT	F1	79.1	—	Unverified
10	PubMedBERT uncased	F1	79.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BINDER	F1	91.9	—	Unverified
2	ConNER	F1	91.3	—	Unverified
3	CL-L2	F1	90.99	—	Unverified
4	aimped	F1	90.95	—	Unverified
5	BertForTokenClassification (Spark NLP)	F1	90.89	—	Unverified
6	BioLinkBERT (large)	F1	90.22	—	Unverified
7	ELECTRAMed	F1	90.03	—	Unverified
8	BLSTM-CNN-Char (SparkNLP)	F1	89.73	—	Unverified
9	Spark NLP	F1	89.73	—	Unverified
10	UniNER-7B	F1	89.34	—	Unverified