Document Layout Analysis

"Document Layout Analysis is performed to determine physical structure of a document, that is, to determine document components. These document components can consist of single connected components-regions [...] of pixels that are adjacent to form single regions [...] , or group of text lines. A text line is a group of characters, symbols, and words that are adjacent, “relatively close” to each other and through which a straight line can be drawn (usually with horizontal or vertical orientation)." L. O'Gorman, "The document spectrum for page layout analysis," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1162-1173, Nov. 1993.

Image credit: PubLayNet: largest dataset ever for document layout analysis

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 99 papers

Title	Date	Tasks	Status	Hype	Score
appjsonify: An Academic Paper PDF-to-JSON Conversion Toolkit	Oct 2, 2023	Document Layout Analysis	CodeCode Available	1	5
Training data-efficient image transformers & distillation through attention	Dec 23, 2020	Document Image ClassificationDocument Layout Analysis	CodeCode Available	1	5
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents	Jul 12, 2024	Document Layout Analysisdocument understanding	CodeCode Available	1	5
DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond	Oct 19, 2023	Document AIDocument Layout Analysis	CodeCode Available	0	5
Vision Grid Transformer for Document Layout Analysis	Aug 29, 2023	Document AIDocument Layout Analysis	CodeCode Available	0	5
ICDAR 2021 Competition on Historical Map Segmentation	May 27, 2021	Contour DetectionDocument Layout Analysis	CodeCode Available	0	5
SFDLA: Source-Free Document Layout Analysis	Mar 24, 2025	AvgDocument Layout Analysis	CodeCode Available	0	5
Text Role Classification in Scientific Charts Using Multimodal Transformers	Feb 8, 2024	Data AugmentationDocument Layout Analysis	CodeCode Available	0	5
dhSegment: A generic deep-learning approach for document segmentation	Apr 27, 2018	Deep LearningDiversity	CodeCode Available	0	5
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking	Apr 18, 2022	cross-modal alignmentDocument AI	CodeCode Available	0	5
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset	Mar 9, 2023	BenchmarkingDeep Learning	CodeCode Available	0	5
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations	May 13, 2021	Document Layout AnalysisGraph Neural Network	CodeCode Available	0	5
A Graphical Approach to Document Layout Analysis	Aug 3, 2023	Document Layout AnalysisGraph Neural Network	CodeCode Available	0	5
Multimodal weighted graph representation for information extraction from visually rich documents.	Jan 5, 2024	Document Layout Analysisdocument understanding	CodeCode Available	0	5
Multi-Task Handwritten Document Layout Analysis	Jun 22, 2018	Document Layout Analysis	CodeCode Available	0	5
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	May 15, 2023	ArticlesDocument Layout Analysis	CodeCode Available	0	5
Class-Agnostic Region-of-Interest Matching in Document Images	Jun 26, 2025	Document Layout Analysisdocument understanding	CodeCode Available	0	5
DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding	Oct 29, 2023	Answer GenerationChart Question Answering	CodeCode Available	0	5
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding	Dec 29, 2020	Document Image ClassificationDocument Layout Analysis	CodeCode Available	0	5
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural Network	Sep 11, 2024	Document Layout Analysisdocument understanding	CodeCode Available	0	5
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction	Sep 8, 2024	Deep LearningDocument Layout Analysis	CodeCode Available	0	5
Document Layout Annotation: Database and Benchmark in the Domain of Public Affairs	Jun 12, 2023	Document Layout Analysis	CodeCode Available	0	5
LayoutReader: Pre-training of Text and Layout for Reading Order Detection	Aug 26, 2021	Document Layout AnalysisOptical Character Recognition (OCR)	CodeCode Available	0	5
Vision-Based Layout Detection from Scientific Literature using Recurrent Convolutional Neural Networks	Oct 18, 2020	Document Layout Analysisobject-detection	—Unverified	0	0
Visual Detection with Context for Document Layout Analysis	Nov 1, 2019	ArticlesDocument Layout Analysis	—Unverified	0	0

Show:10 25 50

← PrevPage 2 of 4Next →

All datasets PubLayNet val U-DIADS-Bib D4LA Document Layout Recognition Challenge mini-dev Document Layout Recognition Challenge test RVL-CDIP

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CDeC-Net	Table	0.98	—	Unverified
2	VGT	Overall	0.96	—	Unverified
3	TRDLU	Overall	0.96	—	Unverified
4	VSR	Overall	0.96	—	Unverified
5	DETR	Overall	0.96	—	Unverified
6	LayoutLMv3-B	Overall	0.95	—	Unverified
7	DiT-L	Overall	0.95	—	Unverified
8	DoPTA	Overall	0.95	—	Unverified
9	UDoc	Overall	0.94	—	Unverified
10	ResNext-101-32×8d	Overall	0.94	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CV-Group	Class Average IoU	83.4	—	Unverified
2	CNKI	Class Average IoU	77.8	—	Unverified
3	VAI-OCR	Class Average IoU	70.7	—	Unverified
4	DeepLabV3+	Class Average IoU	66.5	—	Unverified
5	L3i++	Class Average IoU (Few-shot setting)	61.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DoPTA	mAP	70.72	—	Unverified
2	DocLayout-YOLO	mAP	70.3	—	Unverified
3	VGT	mAP	68.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Faster_RCNN	Overall	0.96	—	Unverified
2	fglihai	Overall	0.96	—	Unverified
3	Faster-RCNN	Overall	0.95	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	fglihai	Overall	0.92	—	Unverified
2	USYD NLP_CS29-2	Overall	0.92	—	Unverified
3	Faster-RCNN	Overall	0.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VisualWordGrid	FAR	28.7	—	Unverified