| OCR-free Document Understanding Transformer | Nov 30, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 3 |
| LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding | Feb 28, 2022 | Document Image Classificationdocument understanding | CodeCode Available | 2 |
| BEiT: BERT Pre-Training of Image Transformers | Jun 15, 2021 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 2 |
| LayoutLM: Pre-training of Text and Layout for Document Image Understanding | Dec 31, 2019 | Document AIdocument-image-classification | CodeCode Available | 2 |
| Multimodal Side-Tuning for Document Classification | Jan 16, 2023 | ClassificationDocument Classification | CodeCode Available | 1 |
| ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding | Oct 12, 2022 | document-image-classificationDocument Image Classification | CodeCode Available | 1 |
| DocXClassifier: High Performance Explainable Deep Network for Document Image Classification | Mar 17, 2022 | ClassificationData Augmentation | CodeCode Available | 1 |
| DiT: Self-supervised Pre-training for Document Image Transformer | Mar 4, 2022 | Document AIdocument-image-classification | CodeCode Available | 1 |
| DocFormer: End-to-End Transformer for Document Understanding | Jun 22, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 1 |
| Revisiting ResNets: Improved Training and Scaling Strategies | Mar 13, 2021 | Action ClassificationDocument Image Classification | CodeCode Available | 1 |
| Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer | Feb 18, 2021 | DecoderDocument Image Classification | CodeCode Available | 1 |
| Training data-efficient image transformers & distillation through attention | Dec 23, 2020 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 1 |
| Improving accuracy and speeding up Document Image Classification through parallel systems | Jun 16, 2020 | Document Classificationdocument-image-classification | CodeCode Available | 1 |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | Jul 26, 2019 | Common Sense ReasoningDocument Image Classification | CodeCode Available | 1 |
| Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models | Dec 18, 2024 | Document Classificationdocument-image-classification | —Unverified | 0 |
| DoPTA: Improving Document Layout Analysis using Patch-Text Alignment | Dec 17, 2024 | Document AIDocument Image Classification | —Unverified | 0 |
| DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification | Jul 4, 2024 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| DistilDoc: Knowledge Distillation for Visually-Rich Document Applications | Jun 12, 2024 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting | May 21, 2024 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |
| CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification | May 6, 2024 | Document Classificationdocument-image-classification | —Unverified | 0 |
| LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding | Mar 21, 2024 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Automatic Recognition of Learning Resource Category in a Digital Library | Nov 28, 2023 | document-image-classificationDocument Image Classification | CodeCode Available | 0 |
| SUT: a new multi-purpose synthetic dataset for Farsi document image analysis | Nov 27, 2023 | Document Classificationdocument-image-classification | CodeCode Available | 0 |
| A Multi-Modal Multilingual Benchmark for Document Image Classification | Oct 25, 2023 | ClassificationCross-Lingual Transfer | —Unverified | 0 |
| GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification | Sep 11, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |