| Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification | Feb 16, 2023 | Few-Shot Image ClassificationFew-Shot Learning | CodeCode Available | 1 |
| A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition | Dec 27, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels | Dec 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images | Nov 18, 2022 | DeblurringImage Deblurring | CodeCode Available | 1 |
| MCSCSet: A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction | Oct 21, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine | Jul 19, 2022 | Classificationobject-detection | CodeCode Available | 1 |
| Detection of Furigana Text in Images | Jul 8, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| hmBERT: Historical Multilingual Language Models for Named Entity Recognition | May 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BankNote-Net: Open dataset for assistive universal currency recognition | Apr 7, 2022 | Contrastive LearningFew-Shot Learning | CodeCode Available | 1 |
| Digitizing Historical Balance Sheet Data: A Practitioner's Guide | Mar 31, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| On the Cross-dataset Generalization in License Plate Recognition | Jan 2, 2022 | Data AugmentationLicense Plate Detection | CodeCode Available | 1 |
| An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images | Dec 3, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Lexically Aware Semi-Supervised Learning for OCR Post-Correction | Nov 4, 2021 | Language ModellingOptical Character Recognition | CodeCode Available | 1 |
| WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition | Oct 7, 2021 | Label Error DetectionOptical Character Recognition | CodeCode Available | 1 |
| TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models | Sep 21, 2021 | Handwritten Text RecognitionLanguage Modeling | CodeCode Available | 1 |
| Post-OCR Document Correction with large Ensembles of Character Sequence-to-Sequence Models | Sep 13, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents | Aug 6, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining | Jul 15, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter | Jun 10, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations | May 23, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Unknown-box Approximation to Improve Optical Character Recognition Performance | May 17, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model | Apr 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Combining Morphological and Histogram based Text Line Segmentation in the OCR Context | Mar 16, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Neural OCR Post-Hoc Correction of Historical Corpora | Feb 1, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Iranis: A Large-scale Dataset of Farsi License Plate Characters | Jan 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems | Dec 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Confidence-aware Non-repetitive Multimodal Transformers for TextCaps | Dec 7, 2020 | Image CaptioningOptical Character Recognition | CodeCode Available | 1 |
| A Two-Step Approach for Automatic OCR Post-Correction | Dec 1, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish | Nov 6, 2020 | Machine TranslationNMT | CodeCode Available | 1 |
| RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering | Oct 24, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| TLGAN: document Text Localization using Generative Adversarial Nets | Oct 22, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Table Structure Recognition using Top-Down and Bottom-Up Cues | Oct 9, 2020 | Cell DetectionOptical Character Recognition | CodeCode Available | 1 |
| A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes | Oct 1, 2020 | Multi-Label ClassificationOptical Character Recognition | CodeCode Available | 1 |
| Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders | Jun 10, 2020 | Cell SegmentationDenoising | CodeCode Available | 1 |
| Boosting on the shoulders of giants in quantum device calibration | May 13, 2020 | BIG-bench Machine LearningFew-Shot Learning | CodeCode Available | 1 |
| PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks | Apr 16, 2020 | Graph LearningKey Information Extraction | CodeCode Available | 1 |
| ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation | Mar 23, 2020 | Domain AdaptationHandwriting generation | CodeCode Available | 1 |
| FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents | May 27, 2019 | FormOptical Character Recognition | CodeCode Available | 1 |
| Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis | Jul 15, 2025 | MarketingOptical Character Recognition | —Unverified | 0 |
| A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends | Jul 14, 2025 | document understandingOptical Character Recognition | —Unverified | 0 |
| Logios : An open source Greek Polytonic Optical Character Recognition system | Jun 26, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages | Jun 22, 2025 | image-classificationImage Classification | —Unverified | 0 |
| An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW | Jun 18, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models | Jun 12, 2025 | Large Language ModelOptical Character Recognition | —Unverified | 0 |
| Task-driven real-world super-resolution of document scans | Jun 8, 2025 | Image Super-ResolutionMulti-Task Learning | —Unverified | 0 |
| Reading in the Dark with Foveated Event Vision | Jun 7, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text Recognition | May 30, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance | May 29, 2025 | Image Super-ResolutionOptical Character Recognition | —Unverified | 0 |
| MT^3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning | May 26, 2025 | document understandingMachine Translation | —Unverified | 0 |
| Words as Geometric Features: Estimating Homography using Optical Character Recognition as Compressed Image Representation | May 25, 2025 | Anomaly DetectionHomography Estimation | —Unverified | 0 |