| On Manipulating Scene Text in the Wild with Diffusion Models | Nov 1, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| GenKIE: Robust Generative Multimodal Document Key Information Extraction | Oct 24, 2023 | DecoderKey Information Extraction | CodeCode Available | 1 |
| Towards reducing hallucination in extracting information from financial reports using Large Language Models | Oct 16, 2023 | HallucinationOptical Character Recognition | —Unverified | 0 |
| EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Oct 16, 2023 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA | Oct 13, 2023 | Graph LearningObject | —Unverified | 0 |
| Invisible Threats: Backdoor Attack in OCR Systems | Oct 12, 2023 | Backdoor AttackOptical Character Recognition | —Unverified | 0 |
| Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks | Oct 8, 2023 | BinarizationCPU | CodeCode Available | 1 |
| Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges | Sep 25, 2023 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 |