Document Layout Analysis

"Document Layout Analysis is performed to determine physical structure of a document, that is, to determine document components. These document components can consist of single connected components-regions [...] of pixels that are adjacent to form single regions [...] , or group of text lines. A text line is a group of characters, symbols, and words that are adjacent, “relatively close” to each other and through which a straight line can be drawn (usually with horizontal or vertical orientation)." L. O'Gorman, "The document spectrum for page layout analysis," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1162-1173, Nov. 1993.

Image credit: PubLayNet: largest dataset ever for document layout analysis

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 99 papers

Title	Date	Tasks	Status	Hype
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images	Aug 25, 2020	Document Layout AnalysisTable Detection	CodeCode Available	1
DocBank: A Benchmark Dataset for Document Layout Analysis	Jun 1, 2020	Document Layout Analysis	CodeCode Available	1
Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers	Feb 14, 2020	Document Layout AnalysisSemantic Segmentation	CodeCode Available	1
Class-Agnostic Region-of-Interest Matching in Document Images	Jun 26, 2025	Document Layout Analysisdocument understanding	CodeCode Available	0
From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents	Jun 25, 2025	Document Layout Analysisobject-detection	—Unverified	0
SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation	May 20, 2025	Document Layout Analysisobject-detection	—Unverified	0
A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court	May 13, 2025	DiversityDocument Layout Analysis	—Unverified	0
Benchmarking Graph Neural Networks for Document Layout Analysis in Public Affairs	May 12, 2025	BenchmarkingDocument Layout Analysis	—Unverified	0
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization	Mar 28, 2025	Document Layout Analysisobject-detection	—Unverified	0
SFDLA: Source-Free Document Layout Analysis	Mar 24, 2025	AvgDocument Layout Analysis	CodeCode Available	0
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Feb 23, 2025	Document Layout AnalysisKnowledge Distillation	—Unverified	0
Graph-based Document Structure Analysis	Feb 4, 2025	Document Layout AnalysisRelation	—Unverified	0
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning	Jan 1, 2025	Document Layout AnalysisImage Segmentation	—Unverified	0
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment	Dec 17, 2024	Document AIDocument Image Classification	—Unverified	0
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural Network	Sep 11, 2024	Document Layout Analysisdocument understanding	CodeCode Available	0
ICDAR 2024 Competition on Few-Shot and Many-Shot Layout Segmentation of Ancient Manuscripts (SAM)	Sep 11, 2024	DiversityDocument Layout Analysis	—Unverified	0
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction	Sep 8, 2024	Deep LearningDocument Layout Analysis	CodeCode Available	0
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications	Jun 12, 2024	document-image-classificationDocument Image Classification	—Unverified	0
UnSupDLA: Towards Unsupervised Document Layout Analysis	Jun 10, 2024	DiversityDocument Layout Analysis	—Unverified	0
Towards Unified Multi-granularity Text Detection with Interactive Attention	May 30, 2024	Document Layout AnalysisOptical Character Recognition (OCR)	—Unverified	0
DLAFormer: An End-to-End Transformer For Document Layout Analysis	May 20, 2024	Document Layout AnalysisDocument Summarization	—Unverified	0
Callico: a Versatile Open-Source Document Image Annotation Platform	May 2, 2024	Document Layout AnalysisHTR	—Unverified	0
A Hybrid Approach for Document Layout Analysis in Document images	Apr 27, 2024	Contrastive LearningDecoder	—Unverified	0
Text Role Classification in Scientific Charts Using Multimodal Transformers	Feb 8, 2024	Data AugmentationDocument Layout Analysis	CodeCode Available	0
AutoIE: An Automated Framework for Information Extraction from Scientific Literature	Jan 30, 2024	Document Layout AnalysisManagement	—Unverified	0

Show:10 25 50

← PrevPage 2 of 4Next →

All datasets PubLayNet val U-DIADS-Bib D4LA Document Layout Recognition Challenge mini-dev Document Layout Recognition Challenge test RVL-CDIP

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CDeC-Net	Table	0.98	—	Unverified
2	VGT	Overall	0.96	—	Unverified
3	TRDLU	Overall	0.96	—	Unverified
4	DETR	Overall	0.96	—	Unverified
5	VSR	Overall	0.96	—	Unverified
6	LayoutLMv3-B	Overall	0.95	—	Unverified
7	DiT-L	Overall	0.95	—	Unverified
8	DoPTA	Overall	0.95	—	Unverified
9	UDoc	Overall	0.94	—	Unverified
10	ResNext-101-32×8d	Overall	0.94	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CV-Group	Class Average IoU	83.4	—	Unverified
2	CNKI	Class Average IoU	77.8	—	Unverified
3	VAI-OCR	Class Average IoU	70.7	—	Unverified
4	DeepLabV3+	Class Average IoU	66.5	—	Unverified
5	L3i++	Class Average IoU (Few-shot setting)	61.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DoPTA	mAP	70.72	—	Unverified
2	DocLayout-YOLO	mAP	70.3	—	Unverified
3	VGT	mAP	68.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Faster_RCNN	Overall	0.96	—	Unverified
2	fglihai	Overall	0.96	—	Unverified
3	Faster-RCNN	Overall	0.95	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	fglihai	Overall	0.92	—	Unverified
2	USYD NLP_CS29-2	Overall	0.92	—	Unverified
3	Faster-RCNN	Overall	0.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VisualWordGrid	FAR	28.7	—	Unverified