SOTAVerified

Key Information Extraction

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Papers

Showing 5174 of 74 papers

TitleStatusHype
NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition0
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
PP-StructureV2: A Stronger Document Analysis SystemCode0
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections0
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural NetworksCode1
RDU: A Region-based Approach to Form-style Document Understanding0
Relational Representation Learning in Visually-Rich Documents0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image MaskingCode0
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingCode2
Comparison of biomedical relationship extraction methods and models for knowledge graph creation0
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents0
One-shot Key Information Extraction from Document with Deep Partial Graph Matching0
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and UnderstandingCode0
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from DocumentsCode1
Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and DissertationsCode0
Key Information Extraction From Documents: Evaluation And GeneratorCode1
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents0
Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts0
Spatial Dual-Modality Graph Reasoning for Key Information ExtractionCode0
ICDAR2019 Competition on Scanned Receipt OCR and Information ExtractionCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional NetworksCode1
LAMBERT: Layout-Aware (Language) Modeling for information extractionCode1
LayoutLM: Pre-training of Text and Layout for Document Image UnderstandingCode2
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RORE (GeoLayoutLM)F198.52Unverified
2GeoLayoutLMF197.97Unverified
3LayoutLMv3 LargeF197.46Unverified
4LayoutMask (large)F197.19Unverified
5LayoutMask (base)F196.99Unverified
6TPP (LayoutMask)F196.92Unverified
7LILTF196.07Unverified
8LayoutLMv2LARGEF196.01Unverified
9LayoutLMv2BASEF194.95Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGE (Excluding OCR mismatch)F197.81Unverified
2RORE (GeoLayoutLM)F196.97Unverified
3LayoutLMv2LARGEF196.61Unverified
4LayoutLMv2BASEF196.25Unverified
5ChatGPT 3.5 SpatialFormatAccuracy77Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGEF185.2Unverified
2LayoutLMv2BASEF183.3Unverified
3LAMBERT (75M)F180.42Unverified
#ModelMetricClaimedVerifiedStatus
1DANF1 (%)95.05Unverified