SOTAVerified

Document Image Classification

Document image classification is the task of classifying documents based on images of their contents.

( Image credit: Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines )

Papers

Showing 110 of 50 papers

TitleStatusHype
Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models0
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment0
DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification0
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications0
Multimodal Adaptive Inference for Document Image Classification with Anytime Early ExitingCode0
CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification0
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding0
Automatic Recognition of Learning Resource Category in a Digital LibraryCode0
SUT: a new multi-purpose synthetic dataset for Farsi document image analysisCode0
A Multi-Modal Multilingual Benchmark for Document Image Classification0
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EAMLAccuracy97.7Unverified
2Cross-ModalAccuracy97.05Unverified
3DocFormerBASEAccuracy96.17Unverified
4LayoutLMV3LargeAccuracy95.93Unverified
5LiLT[EN-R]BASEAccuracy95.68Unverified
6LayoutLMv2LARGEAccuracy95.64Unverified
7TILT-LargeAccuracy95.52Unverified
8DocFormer largeAccuracy95.5Unverified
9LayoutLMv3BASEAccuracy95.44Unverified
10DonutAccuracy95.3Unverified
#ModelMetricClaimedVerifiedStatus
1DocXClassifier-LAccuracy95.57Unverified
2DocBert [DOCBERT]Accuracy91.95Unverified
3Eff-GNN + Word2Vec [word2vec]Accuracy91Unverified
4Multimodal Side-Tuning (MobileNetV2)Accuracy90.5Unverified
5Multimodal Side-Tuning (ResNet50)Accuracy90.3Unverified
6DocBERT [DOCBERT]Accuracy82.3Unverified
7BERT [BERT]Accuracy79Unverified
8Eff-GNN + Word2Vec [word2vec] + Image EmbeddingAccuracy77.5Unverified
9Eff-GNN+ Word2Vec [word2vec]Accuracy73.5Unverified
10VGGMemory7.08Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy89.54Unverified
2Pixel-level RCAccuracy77.22Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy96.68Unverified
2Pixel-level RCAccuracy95.46Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-RS (ResNet-200 + RS training tricks)Top 1 Accuracy - Verb83.4Unverified
#ModelMetricClaimedVerifiedStatus
1Pixel-level RCAccuracy97.62Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy98.43Unverified
#ModelMetricClaimedVerifiedStatus
1CNNAccuracy86Unverified