SOTAVerified

Key Information Extraction

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Papers

Showing 5174 of 74 papers

TitleStatusHype
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding0
NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition0
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition0
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models0
One-shot Key Information Extraction from Document with Deep Partial Graph Matching0
PaddleOCR 3.0 Technical Report0
PDFVQA: A New Dataset for Real-World VQA on PDF Documents0
PP-StructureV2: A Stronger Document Analysis System0
PrIeD-KIE: Towards Privacy Preserved Document Key Information Extraction0
RDU: A Region-based Approach to Form-style Document Understanding0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image MaskingCode0
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised NetworkCode0
Information Redundancy and Biases in Public Document Information Extraction BenchmarksCode0
XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form ParserCode0
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural NetworkCode0
Multimodal weighted graph representation for information extraction from visually rich documents.Code0
AMuRD: Annotated Arabic-English Receipt Dataset for Key Information Extraction and ClassificationCode0
DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-LoopCode0
Different Tastes of Entities: Investigating Human Label Variation in Named Entity AnnotationsCode0
Class-Agnostic Region-of-Interest Matching in Document ImagesCode0
Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and DissertationsCode0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RORE (GeoLayoutLM)F198.52Unverified
2GeoLayoutLMF197.97Unverified
3LayoutLMv3 LargeF197.46Unverified
4LayoutMask (large)F197.19Unverified
5LayoutMask (base)F196.99Unverified
6TPP (LayoutMask)F196.92Unverified
7LILTF196.07Unverified
8LayoutLMv2LARGEF196.01Unverified
9LayoutLMv2BASEF194.95Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGE (Excluding OCR mismatch)F197.81Unverified
2RORE (GeoLayoutLM)F196.97Unverified
3LayoutLMv2LARGEF196.61Unverified
4LayoutLMv2BASEF196.25Unverified
5ChatGPT 3.5 SpatialFormatAccuracy77Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGEF185.2Unverified
2LayoutLMv2BASEF183.3Unverified
3LAMBERT (75M)F180.42Unverified
#ModelMetricClaimedVerifiedStatus
1DANF1 (%)95.05Unverified