| OCR-free Document Understanding Transformer | Nov 30, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 3 |
| LayoutLM: Pre-training of Text and Layout for Document Image Understanding | Dec 31, 2019 | Document AIdocument-image-classification | CodeCode Available | 2 |
| BEiT: BERT Pre-Training of Image Transformers | Jun 15, 2021 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 2 |
| LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding | Feb 28, 2022 | Document Image Classificationdocument understanding | CodeCode Available | 2 |
| Revisiting ResNets: Improved Training and Scaling Strategies | Mar 13, 2021 | Action ClassificationDocument Image Classification | CodeCode Available | 1 |
| DocXClassifier: High Performance Explainable Deep Network for Document Image Classification | Mar 17, 2022 | ClassificationData Augmentation | CodeCode Available | 1 |
| Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer | Feb 18, 2021 | DecoderDocument Image Classification | CodeCode Available | 1 |
| Improving accuracy and speeding up Document Image Classification through parallel systems | Jun 16, 2020 | Document Classificationdocument-image-classification | CodeCode Available | 1 |
| Training data-efficient image transformers & distillation through attention | Dec 23, 2020 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 1 |
| Multimodal Side-Tuning for Document Classification | Jan 16, 2023 | ClassificationDocument Classification | CodeCode Available | 1 |
| DiT: Self-supervised Pre-training for Document Image Transformer | Mar 4, 2022 | Document AIdocument-image-classification | CodeCode Available | 1 |
| DocFormer: End-to-End Transformer for Document Understanding | Jun 22, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 1 |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | Jul 26, 2019 | Common Sense ReasoningDocument Image Classification | CodeCode Available | 1 |
| ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding | Oct 12, 2022 | document-image-classificationDocument Image Classification | CodeCode Available | 1 |
| LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding | Mar 21, 2024 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | May 30, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines | Nov 3, 2017 | ClassificationDeep Learning | —Unverified | 0 |
| Analysis of Convolutional Neural Networks for Document Image Classification | Aug 10, 2017 | ClassificationData Augmentation | —Unverified | 0 |
| CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification | May 6, 2024 | Document Classificationdocument-image-classification | —Unverified | 0 |
| Context-Aware Classification of Legal Document Pages | Apr 5, 2023 | Classificationdocument-image-classification | —Unverified | 0 |
| DistilDoc: Knowledge Distillation for Visually-Rich Document Applications | Jun 12, 2024 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Document AI: Benchmarks, Models and Applications | Nov 16, 2021 | Deep LearningDocument AI | —Unverified | 0 |
| Document image classification, with a specific view on applications of patent images | Jan 13, 2016 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification | Jul 4, 2024 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Domain Agnostic Few-Shot Learning For Document Intelligence | Oct 29, 2021 | ClassificationCross-Domain Few-Shot | —Unverified | 0 |
| DoPTA: Improving Document Layout Analysis using Patch-Text Alignment | Dec 17, 2024 | Document AIDocument Image Classification | —Unverified | 0 |
| EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification | May 11, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Efficient Document Image Classification Using Region-Based Graph Neural Network | Jun 25, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| Evaluating Adversarial Robustness on Document Image Classification | Apr 24, 2023 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval | Feb 25, 2015 | Descriptivedocument-image-classification | —Unverified | 0 |
| PCGAN-CHAR: Progressively Trained Classifier Generative Adversarial Networks for Classification of Noisy Handwritten Bangla Characters | Aug 11, 2019 | ClassificationDenoising | —Unverified | 0 |
| Pixel-level Reconstruction and Classification for Noisy Handwritten Bangla Characters | Jun 21, 2018 | ClassificationDocument Image Classification | —Unverified | 0 |
| A Multi-Modal Multilingual Benchmark for Document Image Classification | Oct 25, 2023 | ClassificationCross-Lingual Transfer | —Unverified | 0 |
| Self-Supervised Representation Learning on Document Images | Apr 18, 2020 | Classificationdocument-image-classification | —Unverified | 0 |
| Toward Automatic Interpretation of 3D Plots | Jun 14, 2021 | Document Image ClassificationShape from Texture | —Unverified | 0 |
| GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification | Sep 11, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Visual and Textual Deep Feature Fusion for Document Image Classification | Jun 16, 2020 | Classificationdocument-image-classification | —Unverified | 0 |
| VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification | May 24, 2022 | Document ClassificationDocument Image Classification | —Unverified | 0 |
| Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models | Dec 18, 2024 | Document Classificationdocument-image-classification | —Unverified | 0 |
| Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification | Apr 11, 2017 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |
| Automatic Recognition of Learning Resource Category in a Digital Library | Nov 28, 2023 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |
| LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding | Apr 18, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 0 |
| Light-Weighted CNN for Text Classification | Apr 16, 2020 | ClassificationDocument Classification | CodeCode Available | 0 |
| LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking | Apr 18, 2022 | cross-modal alignmentDocument AI | CodeCode Available | 0 |
| Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting | May 21, 2024 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |
| LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding | Dec 29, 2020 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 0 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 |
| StructuralLM: Structural Pre-training for Form Understanding | May 24, 2021 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |
| SUT: a new multi-purpose synthetic dataset for Farsi document image analysis | Nov 27, 2023 | Document Classificationdocument-image-classification | CodeCode Available | 0 |
| Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks | Jan 29, 2018 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |