SOTAVerified

Document Image Classification

Document image classification is the task of classifying documents based on images of their contents.

( Image credit: Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines )

Papers

Showing 2650 of 50 papers

TitleStatusHype
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification0
Evaluating Adversarial Robustness on Document Image Classification0
Context-Aware Classification of Legal Document Pages0
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-trainingCode0
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image MaskingCode0
Document AI: Benchmarks, Models and Applications0
Domain Agnostic Few-Shot Learning For Document Intelligence0
Efficient Document Image Classification Using Region-Based Graph Neural Network0
Toward Automatic Interpretation of 3D Plots0
StructuralLM: Structural Pre-training for Form UnderstandingCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
Visual and Textual Deep Feature Fusion for Document Image Classification0
Self-Supervised Representation Learning on Document Images0
Light-Weighted CNN for Text ClassificationCode0
PCGAN-CHAR: Progressively Trained Classifier Generative Adversarial Networks for Classification of Noisy Handwritten Bangla Characters0
Pixel-level Reconstruction and Classification for Noisy Handwritten Bangla Characters0
Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural NetworksCode0
Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines0
Analysis of Convolutional Neural Networks for Document Image Classification0
Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image ClassificationCode0
Document image classification, with a specific view on applications of patent images0
Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EAMLAccuracy97.7Unverified
2Cross-ModalAccuracy97.05Unverified
3DocFormerBASEAccuracy96.17Unverified
4LayoutLMV3LargeAccuracy95.93Unverified
5LiLT[EN-R]BASEAccuracy95.68Unverified
6LayoutLMv2LARGEAccuracy95.64Unverified
7TILT-LargeAccuracy95.52Unverified
8DocFormer largeAccuracy95.5Unverified
9LayoutLMv3BASEAccuracy95.44Unverified
10DonutAccuracy95.3Unverified
#ModelMetricClaimedVerifiedStatus
1DocXClassifier-LAccuracy95.57Unverified
2DocBert [DOCBERT]Accuracy91.95Unverified
3Eff-GNN + Word2Vec [word2vec]Accuracy91Unverified
4Multimodal Side-Tuning (MobileNetV2)Accuracy90.5Unverified
5Multimodal Side-Tuning (ResNet50)Accuracy90.3Unverified
6DocBERT [DOCBERT]Accuracy82.3Unverified
7BERT [BERT]Accuracy79Unverified
8Eff-GNN + Word2Vec [word2vec] + Image EmbeddingAccuracy77.5Unverified
9Eff-GNN+ Word2Vec [word2vec]Accuracy73.5Unverified
10VGGMemory7.08Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy89.54Unverified
2Pixel-level RCAccuracy77.22Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy96.68Unverified
2Pixel-level RCAccuracy95.46Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-RS (ResNet-200 + RS training tricks)Top 1 Accuracy - Verb83.4Unverified
#ModelMetricClaimedVerifiedStatus
1Pixel-level RCAccuracy97.62Unverified
#ModelMetricClaimedVerifiedStatus
1PCGAN-CHARAccuracy98.43Unverified
#ModelMetricClaimedVerifiedStatus
1CNNAccuracy86Unverified