SOTAVerified

Key Information Extraction

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Papers

Showing 5174 of 74 papers

TitleStatusHype
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
DUBLIN -- Document Understanding By Language-Image Network0
Information Redundancy and Biases in Public Document Information Extraction BenchmarksCode0
SIMARA: a database for key-value information extraction from full pages0
Information Extraction from Documents: Question Answering vs Token Classification in real-world setups0
GeoLayoutLM: Geometric Pre-training for Visual Information ExtractionCode0
PDFVQA: A New Dataset for Real-World VQA on PDF Documents0
DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-LoopCode0
NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition0
PP-StructureV2: A Stronger Document Analysis SystemCode0
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections0
RDU: A Region-based Approach to Form-style Document Understanding0
Relational Representation Learning in Visually-Rich Documents0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image MaskingCode0
Comparison of biomedical relationship extraction methods and models for knowledge graph creation0
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents0
One-shot Key Information Extraction from Document with Deep Partial Graph Matching0
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and UnderstandingCode0
Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and DissertationsCode0
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents0
Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts0
Spatial Dual-Modality Graph Reasoning for Key Information ExtractionCode0
ICDAR2019 Competition on Scanned Receipt OCR and Information ExtractionCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RORE (GeoLayoutLM)F198.52Unverified
2GeoLayoutLMF197.97Unverified
3LayoutLMv3 LargeF197.46Unverified
4LayoutMask (large)F197.19Unverified
5LayoutMask (base)F196.99Unverified
6TPP (LayoutMask)F196.92Unverified
7LILTF196.07Unverified
8LayoutLMv2LARGEF196.01Unverified
9LayoutLMv2BASEF194.95Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGE (Excluding OCR mismatch)F197.81Unverified
2RORE (GeoLayoutLM)F196.97Unverified
3LayoutLMv2LARGEF196.61Unverified
4LayoutLMv2BASEF196.25Unverified
5ChatGPT 3.5 SpatialFormatAccuracy77Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGEF185.2Unverified
2LayoutLMv2BASEF183.3Unverified
3LAMBERT (75M)F180.42Unverified
#ModelMetricClaimedVerifiedStatus
1DANF1 (%)95.05Unverified