SOTAVerified

Key Information Extraction

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Papers

Showing 5174 of 74 papers

TitleStatusHype
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents0
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models0
Information Extraction from Documents: Question Answering vs Token Classification in real-world setups0
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections0
KIEval: Evaluation Metric for Document Key Information Extraction0
Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts0
LAPDoc: Layout-Aware Prompting for Documents0
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition0
One-shot Key Information Extraction from Document with Deep Partial Graph Matching0
PDFVQA: A New Dataset for Real-World VQA on PDF Documents0
PPN: Parallel Pointer-based Network for Key Information Extraction with Complex Layouts0
PrIeD-KIE: Towards Privacy Preserved Document Key Information Extraction0
RDU: A Region-based Approach to Form-style Document Understanding0
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction0
Relational Representation Learning in Visually-Rich Documents0
Comparison of biomedical relationship extraction methods and models for knowledge graph creation0
Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use0
SIMARA: a database for key-value information extraction from full pages0
UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents0
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents0
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents0
VKIE: The Application of Key Information Extraction on Video Text0
"What is the value of templates?" Rethinking Document Information Extraction Datasets for LLMs0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RORE (GeoLayoutLM)F198.52Unverified
2GeoLayoutLMF197.97Unverified
3LayoutLMv3 LargeF197.46Unverified
4LayoutMask (large)F197.19Unverified
5LayoutMask (base)F196.99Unverified
6TPP (LayoutMask)F196.92Unverified
7LILTF196.07Unverified
8LayoutLMv2LARGEF196.01Unverified
9LayoutLMv2BASEF194.95Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGE (Excluding OCR mismatch)F197.81Unverified
2RORE (GeoLayoutLM)F196.97Unverified
3LayoutLMv2LARGEF196.61Unverified
4LayoutLMv2BASEF196.25Unverified
5ChatGPT 3.5 SpatialFormatAccuracy77Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGEF185.2Unverified
2LayoutLMv2BASEF183.3Unverified
3LAMBERT (75M)F180.42Unverified
#ModelMetricClaimedVerifiedStatus
1DANF1 (%)95.05Unverified