| Toxicity of the Commons: Curating Open-Source Pre-Training Data | Oct 29, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain | Jun 3, 2023 | BenchmarkingDecoder | CodeCode Available | 1 |
| Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition | May 29, 2025 | Handwritten Mathmatical Expression RecognitionLanguage Modeling | CodeCode Available | 1 |
| Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition | Aug 4, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images | Apr 29, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition | Oct 7, 2021 | Label Error DetectionOptical Character Recognition | CodeCode Available | 1 |
| Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining | Jul 15, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Oct 11, 2024 | Handwritten Text RecognitionHTR | CodeCode Available | 1 |
| bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents | Aug 21, 2023 | distortion correctionOptical Character Recognition | CodeCode Available | 1 |
| Confidence-aware Non-repetitive Multimodal Transformers for TextCaps | Dec 7, 2020 | Image CaptioningOptical Character Recognition | CodeCode Available | 1 |
| Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter | Jun 10, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval | Aug 1, 2024 | AttributeOptical Character Recognition | CodeCode Available | 1 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| GenKIE: Robust Generative Multimodal Document Key Information Extraction | Oct 24, 2023 | DecoderKey Information Extraction | CodeCode Available | 1 |
| Iranis: A Large-scale Dataset of Farsi License Plate Characters | Jan 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| A Two-Step Approach for Automatic OCR Post-Correction | Dec 1, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents | Apr 24, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Efficient OCR for Building a Diverse Digital History | Apr 5, 2023 | DiversityImage Retrieval | CodeCode Available | 1 |
| Detection of Furigana Text in Images | Jul 8, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition | Dec 27, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Exploring Better Text Image Translation with Multimodal Codebook | May 27, 2023 | Machine TranslationOptical Character Recognition | CodeCode Available | 1 |
| BankNote-Net: Open dataset for assistive universal currency recognition | Apr 7, 2022 | Contrastive LearningFew-Shot Learning | CodeCode Available | 1 |
| FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems | Dec 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Digitizing Historical Balance Sheet Data: A Practitioner's Guide | Mar 31, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Geometry Restoration and Dewarping of Camera-Captured Document Images | Jan 6, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Data Generation for Post-OCR correction of Cyrillic handwriting | Nov 27, 2023 | Handwriting generationHandwritten Text Recognition | CodeCode Available | 1 |
| hmBERT: Historical Multilingual Language Models for Named Entity Recognition | May 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders | Jun 10, 2020 | Cell SegmentationDenoising | CodeCode Available | 1 |
| Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents | Aug 6, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images? | May 18, 2025 | Logical ReasoningMultimodal Reasoning | CodeCode Available | 1 |
| Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification | Feb 16, 2023 | Few-Shot Image ClassificationFew-Shot Learning | CodeCode Available | 1 |
| A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes | Oct 1, 2020 | Multi-Label ClassificationOptical Character Recognition | CodeCode Available | 1 |
| Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach | Aug 27, 2024 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 1 |
| Boosting on the shoulders of giants in quantum device calibration | May 13, 2020 | BIG-bench Machine LearningFew-Shot Learning | CodeCode Available | 1 |
| Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images | Nov 18, 2022 | DeblurringImage Deblurring | CodeCode Available | 1 |
| RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering | Oct 24, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends | Jul 14, 2025 | document understandingOptical Character Recognition | —Unverified | 0 |
| A survey of modern optical character recognition techniques | Dec 13, 2014 | Image EnhancementOptical Character Recognition | —Unverified | 0 |
| Ancient but Digitized: Developing Handwritten Optical Character Recognition for East Syriac Script Through Creating KHAMIS Dataset | Aug 24, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| A Study of Sindhi Related and Arabic Script Adapted languages Recognition | Dec 13, 2014 | ArticlesOptical Character Recognition | —Unverified | 0 |
| Advancing Visual Specification of Code Requirements for Graphs | Jul 29, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma | Dec 14, 2024 | GPULicense Plate Recognition | —Unverified | 0 |
| An Assessment of the Impact of OCR Noise on Language Models | Jan 26, 2022 | Language ModellingOptical Character Recognition | —Unverified | 0 |
| Abstractive Information Extraction from Scanned Invoices (AIESI) using End-to-end Sequential Approach | Sep 12, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Artificial neural networks and fuzzy logic for recognizing alphabet characters and mathematical symbols | Jul 6, 2016 | Image SegmentationOptical Character Recognition | —Unverified | 0 |
| Artificial Eye for the Blind | Jul 7, 2023 | Objectobject-detection | —Unverified | 0 |
| An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW | Jun 18, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Confidence-Aware Document OCR Error Detection | Sep 6, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |