PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction Mar 21, 2025 CPU Document Layout Analysis
Code Code Available 95 DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Oct 16, 2024 Document Layout Analysis document understanding
Code Code Available 95 DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis Jun 2, 2022 Document Layout Analysis Object Detection
Code Code Available 85 A Large Dataset of Historical Japanese Documents with Complex Layouts Apr 18, 2020 Document Layout Analysis
Code Code Available 35 PubLayNet: largest dataset ever for document layout analysis Aug 16, 2019 Articles Document Layout Analysis
Code Code Available 25 UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis Mar 20, 2025 Document Layout Analysis Document Summarization
Code Code Available 25 BEiT: BERT Pre-Training of Image Transformers Jun 15, 2021 Document Image Classification Document Layout Analysis
Code Code Available 25 LayoutLM: Pre-training of Text and Layout for Document Image Understanding Dec 31, 2019 Document AI document-image-classification
Code Code Available 25 Towards End-to-End Unified Scene Text Detection and Layout Analysis Mar 28, 2022 Document Layout Analysis Scene Text Detection
Code Code Available 25 Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis Jan 22, 2024 Document Layout Analysis Document Summarization
Code Code Available 25 CTE: A Dataset for Contextualized Table Extraction Feb 2, 2023 Document Layout Analysis Table Detection
Code Code Available 15 DiT: Self-supervised Pre-training for Document Image Transformer Mar 4, 2022 Document AI document-image-classification
Code Code Available 15 Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks Aug 23, 2022 Document Layout Analysis document understanding
Code Code Available 15 DocBank: A Benchmark Dataset for Document Layout Analysis Jun 1, 2020 Document Layout Analysis
Code Code Available 15 CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images Aug 25, 2020 Document Layout Analysis Table Detection
Code Code Available 15 docExtractor: An off-the-shelf historical document element extraction Dec 15, 2020 Document Layout Analysis Segmentation
Code Code Available 15 Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis Aug 22, 2022 Component Classification Document Layout Analysis
Code Code Available 15 M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis Jan 1, 2023 Articles Document Layout Analysis
Code Code Available 15 DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer Jan 27, 2022 Decision Making Document Layout Analysis
Code Code Available 15 PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis Apr 24, 2023 Document Layout Analysis Graph Neural Network
Code Code Available 15 DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis Jul 6, 2021 Document Layout Analysis Image Generation
Code Code Available 15 Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis Aug 29, 2023 Document AI Document Layout Analysis
Code Code Available 15 RoDLA: Benchmarking the Robustness of Document Layout Analysis Models Mar 21, 2024 Benchmarking Document Layout Analysis
Code Code Available 15 Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers Feb 14, 2020 Document Layout Analysis Semantic Segmentation
Code Code Available 15 SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation May 1, 2023 Document Layout Analysis object-detection
Code Code Available 15 appjsonify: An Academic Paper PDF-to-JSON Conversion Toolkit Oct 2, 2023 Document Layout Analysis
Code Code Available 15 Training data-efficient image transformers & distillation through attention Dec 23, 2020 Document Image Classification Document Layout Analysis
Code Code Available 15 DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents Jul 12, 2024 Document Layout Analysis document understanding
Code Code Available 15 DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond Oct 19, 2023 Document AI Document Layout Analysis
Code Code Available 05 Vision Grid Transformer for Document Layout Analysis Aug 29, 2023 Document AI Document Layout Analysis
Code Code Available 05 ICDAR 2021 Competition on Historical Map Segmentation May 27, 2021 Contour Detection Document Layout Analysis
Code Code Available 05 SFDLA: Source-Free Document Layout Analysis Mar 24, 2025 Avg Document Layout Analysis
Code Code Available 05 Text Role Classification in Scientific Charts Using Multimodal Transformers Feb 8, 2024 Data Augmentation Document Layout Analysis
Code Code Available 05 dhSegment: A generic deep-learning approach for document segmentation Apr 27, 2018 Deep Learning Diversity
Code Code Available 05 LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Apr 18, 2022 cross-modal alignment Document AI
Code Code Available 05 BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset Mar 9, 2023 Benchmarking Deep Learning
Code Code Available 05 VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations May 13, 2021 Document Layout Analysis Graph Neural Network
Code Code Available 05 A Graphical Approach to Document Layout Analysis Aug 3, 2023 Document Layout Analysis Graph Neural Network
Code Code Available 05 Multimodal weighted graph representation for information extraction from visually rich documents. Jan 5, 2024 Document Layout Analysis document understanding
Code Code Available 05 Multi-Task Handwritten Document Layout Analysis Jun 22, 2018 Document Layout Analysis
Code Code Available 05 M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis May 15, 2023 Articles Document Layout Analysis
Code Code Available 05 Class-Agnostic Region-of-Interest Matching in Document Images Jun 26, 2025 Document Layout Analysis document understanding
Code Code Available 05 DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding Oct 29, 2023 Answer Generation Chart Question Answering
Code Code Available 05 LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding Dec 29, 2020 Document Image Classification Document Layout Analysis
Code Code Available 05 Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural Network Sep 11, 2024 Document Layout Analysis document understanding
Code Code Available 05 PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction Sep 8, 2024 Deep Learning Document Layout Analysis
Code Code Available 05 Document Layout Annotation: Database and Benchmark in the Domain of Public Affairs Jun 12, 2023 Document Layout Analysis
Code Code Available 05 LayoutReader: Pre-training of Text and Layout for Reading Order Detection Aug 26, 2021 Document Layout Analysis Optical Character Recognition (OCR)
Code Code Available 05 Vision-Based Layout Detection from Scientific Literature using Recurrent Convolutional Neural Networks Oct 18, 2020 Document Layout Analysis object-detection
— Unverified 00 Visual Detection with Context for Document Layout Analysis Nov 1, 2019 Articles Document Layout Analysis
— Unverified 00