SOTAVerified

Document Layout Analysis

"Document Layout Analysis is performed to determine physical structure of a document, that is, to determine document components. These document components can consist of single connected components-regions [...] of pixels that are adjacent to form single regions [...] , or group of text lines. A text line is a group of characters, symbols, and words that are adjacent, “relatively close” to each other and through which a straight line can be drawn (usually with horizontal or vertical orientation)." L. O'Gorman, "The document spectrum for page layout analysis," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1162-1173, Nov. 1993.

Image credit: PubLayNet: largest dataset ever for document layout analysis

Papers

Showing 110 of 99 papers

TitleStatusHype
Class-Agnostic Region-of-Interest Matching in Document ImagesCode0
From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents0
SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation0
A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court0
Benchmarking Graph Neural Networks for Document Layout Analysis in Public Affairs0
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization0
SFDLA: Source-Free Document Layout AnalysisCode0
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data ConstructionCode9
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure AnalysisCode2
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation0
Show:102550
← PrevPage 1 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DoPTA mAP70.72Unverified
2DocLayout-YOLO mAP70.3Unverified
3VGT mAP68.8Unverified