| DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency | Nov 9, 2023 | document understandingKey Information Extraction | —Unverified | 0 |
| On Manipulating Scene Text in the Wild with Diffusion Models | Nov 1, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Oct 16, 2023 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Towards reducing hallucination in extracting information from financial reports using Large Language Models | Oct 16, 2023 | HallucinationOptical Character Recognition | —Unverified | 0 |
| Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA | Oct 13, 2023 | Graph LearningObject | —Unverified | 0 |
| Invisible Threats: Backdoor Attack in OCR Systems | Oct 12, 2023 | Backdoor AttackOptical Character Recognition | —Unverified | 0 |
| Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges | Sep 25, 2023 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Handwritten image augmentation | Aug 26, 2023 | Data AugmentationImage Augmentation | —Unverified | 0 |
| Bengali Document Layout Analysis with Detectron2 | Aug 26, 2023 | Data AugmentationDocument Layout Analysis | —Unverified | 0 |
| DISGO: Automatic End-to-End Evaluation for Scene Text OCR | Aug 25, 2023 | Machine TranslationOptical Character Recognition | —Unverified | 0 |