TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document Mar 7, 2024 document understanding Key Information Extraction
Code Code Available 5OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models May 13, 2023 Key Information Extraction Nutrition
Code Code Available 2LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding Feb 28, 2022 Document Image Classification document understanding
Code Code Available 2LayoutLM: Pre-training of Text and Layout for Document Image Understanding Dec 31, 2019 Document AI document-image-classification
Code Code Available 2A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding Jul 2, 2024 document understanding Key Information Extraction
Code Code Available 2PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction Jan 7, 2024 Key Information Extraction Key-value Pair Extraction
Code Code Available 1Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks Aug 23, 2022 Document Layout Analysis document understanding
Code Code Available 1ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding Oct 12, 2022 document-image-classification Document Image Classification
Code Code Available 1KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents May 1, 2024 Diversity Key Information Extraction
Code Code Available 1Key Information Extraction From Documents: Evaluation And Generator Jun 9, 2021 Decoder Key Information Extraction
Code Code Available 1DocILE Benchmark for Document Information Localization and Extraction Feb 11, 2023 Key Information Extraction Unsupervised Pre-training
Code Code Available 1PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks Apr 16, 2020 Graph Learning Key Information Extraction
Code Code Available 1Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction Oct 17, 2023 Entity Linking Key Information Extraction
Code Code Available 1BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Aug 10, 2021 Key Information Extraction Language Modeling
Code Code Available 1LAMBERT: Layout-Aware (Language) Modeling for information extraction Feb 19, 2020 Key Information Extraction Language Modeling
Code Code Available 1Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation Oct 25, 2023 Handwritten Text Recognition Key Information Extraction
Code Code Available 1Form-NLU: Dataset for the Form Natural Language Understanding Apr 4, 2023 4k Form
Code Code Available 1GenKIE: Robust Generative Multimodal Document Key Information Extraction Oct 24, 2023 Decoder Key Information Extraction
Code Code Available 1Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding Sep 29, 2024 document understanding Entity Linking
Code Code Available 1Entity Relation Extraction as Dependency Parsing in Visually Rich Documents Oct 19, 2021 Dependency Parsing Entity Linking
— Unverified 0A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents Apr 16, 2024 document understanding Key Information Extraction
— Unverified 0CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy Dec 3, 2024 Hallucination Key Information Extraction
— Unverified 0Construction of a Syntactic Analysis Map for Yi Shui School through Text Mining and Natural Language Processing Research Feb 16, 2024 graph construction Information Retrieval
— Unverified 0Data Efficient Training of a U-Net Based Architecture for Structured Documents Localization Oct 2, 2023 Decoder Deep Learning
— Unverified 0Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review Jul 23, 2024 Deep Learning document understanding
— Unverified 0DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency Nov 9, 2023 document understanding Key Information Extraction
— Unverified 0DUBLIN -- Document Understanding By Language-Image Network May 23, 2023 Document Classification document understanding
— Unverified 0Emergency Communication: OTFS-Based Semantic Transmission with Diffusion Noise Suppression Apr 10, 2025 Denoising Key Information Extraction
— Unverified 0End-to-End Document Classification and Key Information Extraction using Assignment Optimization Jun 1, 2023 Classification Document Classification
— Unverified 0Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models Apr 27, 2025 Key Information Extraction Natural Language Understanding
— Unverified 0Information Extraction from Documents: Question Answering vs Token Classification in real-world setups Apr 21, 2023 Classification Few-Shot Learning
— Unverified 0Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections Oct 7, 2022 Key Information Extraction Line Detection
— Unverified 0KIEval: Evaluation Metric for Document Key Information Extraction Mar 7, 2025 Key Information Extraction
— Unverified 0Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts May 12, 2021 Key Information Extraction
— Unverified 0LAPDoc: Layout-Aware Prompting for Documents Feb 15, 2024 document understanding Key Information Extraction
— Unverified 0LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding May 30, 2023 document-image-classification Document Image Classification
— Unverified 0NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition Nov 1, 2022 Key Information Extraction NER
— Unverified 0One-shot Key Information Extraction from Document with Deep Partial Graph Matching Sep 26, 2021 Graph Matching Key Information Extraction
— Unverified 0PDFVQA: A New Dataset for Real-World VQA on PDF Documents Apr 13, 2023 document understanding Key Information Extraction
— Unverified 0PPN: Parallel Pointer-based Network for Key Information Extraction with Complex Layouts Jul 20, 2023 Key Information Extraction
— Unverified 0PrIeD-KIE: Towards Privacy Preserved Document Key Information Extraction Oct 5, 2023 Document AI Federated Learning
— Unverified 0RDU: A Region-based Approach to Form-style Document Understanding Jun 14, 2022 document understanding Form
— Unverified 0RealKIE: Five Novel Datasets for Enterprise Key Information Extraction Mar 29, 2024 Key Information Extraction Optical Character Recognition (OCR)
— Unverified 0Relational Representation Learning in Visually-Rich Documents May 5, 2022 Contrastive Learning Key Information Extraction
— Unverified 0Comparison of biomedical relationship extraction methods and models for knowledge graph creation Jan 5, 2022 Key Information Extraction Knowledge Graphs
— Unverified 0Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use May 30, 2024 document understanding Key Information Extraction
— Unverified 0See then Tell: Enhancing Key Information Extraction with Vision Grounding Sep 29, 2024 Image to text Key Information Extraction
— Unverified 0SIMARA: a database for key-value information extraction from full pages Apr 26, 2023 Handwriting Recognition Handwritten Text Recognition
— Unverified 0UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents Jan 17, 2024 Decoder Form
— Unverified 0ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents May 25, 2021 Image Segmentation Key Information Extraction
— Unverified 0