| OCR-free Document Understanding Transformer | Nov 30, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 3 | 5 |
| LayoutLM: Pre-training of Text and Layout for Document Image Understanding | Dec 31, 2019 | Document AIdocument-image-classification | CodeCode Available | 2 | 5 |
| LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding | Feb 28, 2022 | Document Image Classificationdocument understanding | CodeCode Available | 2 | 5 |
| BEiT: BERT Pre-Training of Image Transformers | Jun 15, 2021 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 2 | 5 |
| Improving accuracy and speeding up Document Image Classification through parallel systems | Jun 16, 2020 | Document Classificationdocument-image-classification | CodeCode Available | 1 | 5 |
| DiT: Self-supervised Pre-training for Document Image Transformer | Mar 4, 2022 | Document AIdocument-image-classification | CodeCode Available | 1 | 5 |
| Multimodal Side-Tuning for Document Classification | Jan 16, 2023 | ClassificationDocument Classification | CodeCode Available | 1 | 5 |
| DocFormer: End-to-End Transformer for Document Understanding | Jun 22, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 1 | 5 |
| Revisiting ResNets: Improved Training and Scaling Strategies | Mar 13, 2021 | Action ClassificationDocument Image Classification | CodeCode Available | 1 | 5 |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | Jul 26, 2019 | Common Sense ReasoningDocument Image Classification | CodeCode Available | 1 | 5 |
| ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding | Oct 12, 2022 | document-image-classificationDocument Image Classification | CodeCode Available | 1 | 5 |
| Training data-efficient image transformers & distillation through attention | Dec 23, 2020 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 1 | 5 |
| DocXClassifier: High Performance Explainable Deep Network for Document Image Classification | Mar 17, 2022 | ClassificationData Augmentation | CodeCode Available | 1 | 5 |
| Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer | Feb 18, 2021 | DecoderDocument Image Classification | CodeCode Available | 1 | 5 |
| Automatic Recognition of Learning Resource Category in a Digital Library | Nov 28, 2023 | document-image-classificationDocument Image Classification | CodeCode Available | 0 | 5 |
| Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification | Apr 11, 2017 | document-image-classificationDocument Image Classification | CodeCode Available | 0 | 5 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 | 5 |
| Light-Weighted CNN for Text Classification | Apr 16, 2020 | ClassificationDocument Classification | CodeCode Available | 0 | 5 |
| LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding | Apr 18, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 0 | 5 |
| Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting | May 21, 2024 | document-image-classificationDocument Image Classification | CodeCode Available | 0 | 5 |
| StructuralLM: Structural Pre-training for Form Understanding | May 24, 2021 | document-image-classificationDocument Image Classification | CodeCode Available | 0 | 5 |
| LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking | Apr 18, 2022 | cross-modal alignmentDocument AI | CodeCode Available | 0 | 5 |
| SUT: a new multi-purpose synthetic dataset for Farsi document image analysis | Nov 27, 2023 | Document Classificationdocument-image-classification | CodeCode Available | 0 | 5 |
| LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding | Dec 29, 2020 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 0 | 5 |
| Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks | Jan 29, 2018 | document-image-classificationDocument Image Classification | CodeCode Available | 0 | 5 |