SOTAVerified

Key Information Extraction

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Papers

Showing 150 of 74 papers

TitleStatusHype
TextMonkey: An OCR-Free Large Multimodal Model for Understanding DocumentCode5
OCRBench: On the Hidden Mystery of OCR in Large Multimodal ModelsCode2
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingCode2
LayoutLM: Pre-training of Text and Layout for Document Image UnderstandingCode2
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair ExtractionCode1
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural NetworksCode1
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business DocumentsCode1
Key Information Extraction From Documents: Evaluation And GeneratorCode1
DocILE Benchmark for Document Information Localization and ExtractionCode1
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional NetworksCode1
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path PredictionCode1
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from DocumentsCode1
LAMBERT: Layout-Aware (Language) Modeling for information extractionCode1
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth EvaluationCode1
Form-NLU: Dataset for the Form Natural Language UnderstandingCode1
GenKIE: Robust Generative Multimodal Document Key Information ExtractionCode1
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document UnderstandingCode1
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents0
A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents0
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy0
Construction of a Syntactic Analysis Map for Yi Shui School through Text Mining and Natural Language Processing Research0
Data Efficient Training of a U-Net Based Architecture for Structured Documents Localization0
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review0
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency0
DUBLIN -- Document Understanding By Language-Image Network0
Emergency Communication: OTFS-Based Semantic Transmission with Diffusion Noise Suppression0
End-to-End Document Classification and Key Information Extraction using Assignment Optimization0
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models0
Information Extraction from Documents: Question Answering vs Token Classification in real-world setups0
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections0
KIEval: Evaluation Metric for Document Key Information Extraction0
Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts0
LAPDoc: Layout-Aware Prompting for Documents0
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition0
One-shot Key Information Extraction from Document with Deep Partial Graph Matching0
PDFVQA: A New Dataset for Real-World VQA on PDF Documents0
PPN: Parallel Pointer-based Network for Key Information Extraction with Complex Layouts0
PrIeD-KIE: Towards Privacy Preserved Document Key Information Extraction0
RDU: A Region-based Approach to Form-style Document Understanding0
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction0
Relational Representation Learning in Visually-Rich Documents0
Comparison of biomedical relationship extraction methods and models for knowledge graph creation0
Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use0
See then Tell: Enhancing Key Information Extraction with Vision Grounding0
SIMARA: a database for key-value information extraction from full pages0
UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents0
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RORE (GeoLayoutLM)F198.52Unverified
2GeoLayoutLMF197.97Unverified
3LayoutLMv3 LargeF197.46Unverified
4LayoutMask (large)F197.19Unverified
5LayoutMask (base)F196.99Unverified
6TPP (LayoutMask)F196.92Unverified
7LILTF196.07Unverified
8LayoutLMv2LARGEF196.01Unverified
9LayoutLMv2BASEF194.95Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGE (Excluding OCR mismatch)F197.81Unverified
2RORE (GeoLayoutLM)F196.97Unverified
3LayoutLMv2LARGEF196.61Unverified
4LayoutLMv2BASEF196.25Unverified
5ChatGPT 3.5 SpatialFormatAccuracy77Unverified
#ModelMetricClaimedVerifiedStatus
1LayoutLMv2LARGEF185.2Unverified
2LayoutLMv2BASEF183.3Unverified
3LAMBERT (75M)F180.42Unverified
#ModelMetricClaimedVerifiedStatus
1DANF1 (%)95.05Unverified