PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction Mar 21, 2025 CPU Document Layout Analysis
Code Code Available 9DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Oct 16, 2024 Document Layout Analysis document understanding
Code Code Available 9DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis Jun 2, 2022 Document Layout Analysis Object Detection
Code Code Available 8A Large Dataset of Historical Japanese Documents with Complex Layouts Apr 18, 2020 Document Layout Analysis
Code Code Available 3UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis Mar 20, 2025 Document Layout Analysis Document Summarization
Code Code Available 2Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis Jan 22, 2024 Document Layout Analysis Document Summarization
Code Code Available 2Towards End-to-End Unified Scene Text Detection and Layout Analysis Mar 28, 2022 Document Layout Analysis Scene Text Detection
Code Code Available 2BEiT: BERT Pre-Training of Image Transformers Jun 15, 2021 Document Image Classification Document Layout Analysis
Code Code Available 2LayoutLM: Pre-training of Text and Layout for Document Image Understanding Dec 31, 2019 Document AI document-image-classification
Code Code Available 2PubLayNet: largest dataset ever for document layout analysis Aug 16, 2019 Articles Document Layout Analysis
Code Code Available 2DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents Jul 12, 2024 Document Layout Analysis document understanding
Code Code Available 1RoDLA: Benchmarking the Robustness of Document Layout Analysis Models Mar 21, 2024 Benchmarking Document Layout Analysis
Code Code Available 1appjsonify: An Academic Paper PDF-to-JSON Conversion Toolkit Oct 2, 2023 Document Layout Analysis
Code Code Available 1Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis Aug 29, 2023 Document AI Document Layout Analysis
Code Code Available 1SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation May 1, 2023 Document Layout Analysis object-detection
Code Code Available 1PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis Apr 24, 2023 Document Layout Analysis Graph Neural Network
Code Code Available 1CTE: A Dataset for Contextualized Table Extraction Feb 2, 2023 Document Layout Analysis Table Detection
Code Code Available 1M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis Jan 1, 2023 Articles Document Layout Analysis
Code Code Available 1Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks Aug 23, 2022 Document Layout Analysis document understanding
Code Code Available 1Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis Aug 22, 2022 Component Classification Document Layout Analysis
Code Code Available 1DiT: Self-supervised Pre-training for Document Image Transformer Mar 4, 2022 Document AI document-image-classification
Code Code Available 1DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer Jan 27, 2022 Decision Making Document Layout Analysis
Code Code Available 1DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis Jul 6, 2021 Document Layout Analysis Image Generation
Code Code Available 1Training data-efficient image transformers & distillation through attention Dec 23, 2020 Document Image Classification Document Layout Analysis
Code Code Available 1docExtractor: An off-the-shelf historical document element extraction Dec 15, 2020 Document Layout Analysis Segmentation
Code Code Available 1CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images Aug 25, 2020 Document Layout Analysis Table Detection
Code Code Available 1DocBank: A Benchmark Dataset for Document Layout Analysis Jun 1, 2020 Document Layout Analysis
Code Code Available 1Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers Feb 14, 2020 Document Layout Analysis Semantic Segmentation
Code Code Available 1Class-Agnostic Region-of-Interest Matching in Document Images Jun 26, 2025 Document Layout Analysis document understanding
Code Code Available 0From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents Jun 25, 2025 Document Layout Analysis object-detection
— Unverified 0SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation May 20, 2025 Document Layout Analysis object-detection
— Unverified 0A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court May 13, 2025 Diversity Document Layout Analysis
— Unverified 0Benchmarking Graph Neural Networks for Document Layout Analysis in Public Affairs May 12, 2025 Benchmarking Document Layout Analysis
— Unverified 0AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization Mar 28, 2025 Document Layout Analysis object-detection
— Unverified 0SFDLA: Source-Free Document Layout Analysis Mar 24, 2025 Avg Document Layout Analysis
Code Code Available 0EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation Feb 23, 2025 Document Layout Analysis Knowledge Distillation
— Unverified 0Graph-based Document Structure Analysis Feb 4, 2025 Document Layout Analysis Relation
— Unverified 0DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning Jan 1, 2025 Document Layout Analysis Image Segmentation
— Unverified 0DoPTA: Improving Document Layout Analysis using Patch-Text Alignment Dec 17, 2024 Document AI Document Image Classification
— Unverified 0Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural Network Sep 11, 2024 Document Layout Analysis document understanding
Code Code Available 0ICDAR 2024 Competition on Few-Shot and Many-Shot Layout Segmentation of Ancient Manuscripts (SAM) Sep 11, 2024 Diversity Document Layout Analysis
— Unverified 0PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction Sep 8, 2024 Deep Learning Document Layout Analysis
Code Code Available 0DistilDoc: Knowledge Distillation for Visually-Rich Document Applications Jun 12, 2024 document-image-classification Document Image Classification
— Unverified 0UnSupDLA: Towards Unsupervised Document Layout Analysis Jun 10, 2024 Diversity Document Layout Analysis
— Unverified 0Towards Unified Multi-granularity Text Detection with Interactive Attention May 30, 2024 Document Layout Analysis Optical Character Recognition (OCR)
— Unverified 0DLAFormer: An End-to-End Transformer For Document Layout Analysis May 20, 2024 Document Layout Analysis Document Summarization
— Unverified 0Callico: a Versatile Open-Source Document Image Annotation Platform May 2, 2024 Document Layout Analysis HTR
— Unverified 0A Hybrid Approach for Document Layout Analysis in Document images Apr 27, 2024 Contrastive Learning Decoder
— Unverified 0Text Role Classification in Scientific Charts Using Multimodal Transformers Feb 8, 2024 Data Augmentation Document Layout Analysis
Code Code Available 0AutoIE: An Automated Framework for Information Extraction from Scientific Literature Jan 30, 2024 Document Layout Analysis Management
— Unverified 0