SOTAVerified

Document Image Classification

Document image classification is the task of classifying documents based on images of their contents.

( Image credit: Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines )

Papers

Showing 110 of 50 papers

TitleStatusHype
OCR-free Document Understanding TransformerCode3
LayoutLM: Pre-training of Text and Layout for Document Image UnderstandingCode2
BEiT: BERT Pre-Training of Image TransformersCode2
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingCode2
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout TransformerCode1
Improving accuracy and speeding up Document Image Classification through parallel systemsCode1
DocFormer: End-to-End Transformer for Document UnderstandingCode1
DiT: Self-supervised Pre-training for Document Image TransformerCode1
DocXClassifier: High Performance Explainable Deep Network for Document Image ClassificationCode1
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EAMLAccuracy97.7Unverified
2Cross-ModalAccuracy97.05Unverified
3DocFormerBASEAccuracy96.17Unverified
4LayoutLMV3LargeAccuracy95.93Unverified
5LiLT[EN-R]BASEAccuracy95.68Unverified
6LayoutLMv2LARGEAccuracy95.64Unverified
7TILT-LargeAccuracy95.52Unverified
8DocFormer largeAccuracy95.5Unverified
9LayoutLMv3BASEAccuracy95.44Unverified
10DonutAccuracy95.3Unverified
#ModelMetricClaimedVerifiedStatus
1DocXClassifier-LAccuracy95.57Unverified
2DocBert [DOCBERT]Accuracy91.95Unverified
3Eff-GNN + Word2Vec [word2vec]Accuracy91Unverified
4Multimodal Side-Tuning (MobileNetV2)Accuracy90.5Unverified
5Multimodal Side-Tuning (ResNet50)Accuracy90.3Unverified
6DocBERT [DOCBERT]Accuracy82.3Unverified
7BERT [BERT]Accuracy79Unverified
8Eff-GNN + Word2Vec [word2vec] + Image EmbeddingAccuracy77.5Unverified
9Eff-GNN+ Word2Vec [word2vec]Accuracy73.5Unverified
10VGGMemory7.08Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy89.54Unverified
2Pixel-level RCAccuracy77.22Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy96.68Unverified
2Pixel-level RCAccuracy95.46Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-RS (ResNet-200 + RS training tricks)Top 1 Accuracy - Verb83.4Unverified
#ModelMetricClaimedVerifiedStatus
1Pixel-level RCAccuracy97.62Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy98.43Unverified
#ModelMetricClaimedVerifiedStatus
1CNNAccuracy86Unverified