Key Information Extraction

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 74 papers

Title	Date	Tasks	Status	Hype	Score
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document	Mar 7, 2024	document understandingKey Information Extraction	CodeCode Available	5	5
LayoutLM: Pre-training of Text and Layout for Document Image Understanding	Dec 31, 2019	Document AIdocument-image-classification	CodeCode Available	2	5
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding	Jul 2, 2024	document understandingKey Information Extraction	CodeCode Available	2	5
OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models	May 13, 2023	Key Information ExtractionNutrition	CodeCode Available	2	5
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding	Feb 28, 2022	Document Image Classificationdocument understanding	CodeCode Available	2	5
GenKIE: Robust Generative Multimodal Document Key Information Extraction	Oct 24, 2023	DecoderKey Information Extraction	CodeCode Available	1	5
Form-NLU: Dataset for the Form Natural Language Understanding	Apr 4, 2023	4kForm	CodeCode Available	1	5
Key Information Extraction From Documents: Evaluation And Generator	Jun 9, 2021	DecoderKey Information Extraction	CodeCode Available	1	5
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents	Aug 10, 2021	Key Information ExtractionLanguage Modeling	CodeCode Available	1	5
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks	Aug 23, 2022	Document Layout Analysisdocument understanding	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 8Next →

All datasets CORD SROIE Kleister NDA SIMARA

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RORE (GeoLayoutLM)	F1	98.52	—	Unverified
2	GeoLayoutLM	F1	97.97	—	Unverified
3	LayoutLMv3 Large	F1	97.46	—	Unverified
4	LayoutMask (large)	F1	97.19	—	Unverified
5	LayoutMask (base)	F1	96.99	—	Unverified
6	TPP (LayoutMask)	F1	96.92	—	Unverified
7	LILT	F1	96.07	—	Unverified
8	LayoutLMv2LARGE	F1	96.01	—	Unverified
9	LayoutLMv2BASE	F1	94.95	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LayoutLMv2LARGE (Excluding OCR mismatch)	F1	97.81	—	Unverified
2	RORE (GeoLayoutLM)	F1	96.97	—	Unverified
3	LayoutLMv2LARGE	F1	96.61	—	Unverified
4	LayoutLMv2BASE	F1	96.25	—	Unverified
5	ChatGPT 3.5 SpatialFormat	Accuracy	77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LayoutLMv2LARGE	F1	85.2	—	Unverified
2	LayoutLMv2BASE	F1	83.3	—	Unverified
3	LAMBERT (75M)	F1	80.42	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DAN	F1 (%)	95.05	—	Unverified