| Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents | Aug 6, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 | 5 |
| Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification | Feb 16, 2023 | Few-Shot Image ClassificationFew-Shot Learning | CodeCode Available | 1 | 5 |
| PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents | Mar 23, 2024 | ArticlesOptical Character Recognition | CodeCode Available | 1 | 5 |
| Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks | Oct 8, 2023 | BinarizationCPU | CodeCode Available | 1 | 5 |
| hmBERT: Historical Multilingual Language Models for Named Entity Recognition | May 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| A Two-Step Approach for Automatic OCR Post-Correction | Dec 1, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents | Aug 21, 2023 | distortion correctionOptical Character Recognition | CodeCode Available | 1 | 5 |
| An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images | Dec 3, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering | Oct 24, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach | Aug 27, 2024 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 1 | 5 |
| GenKIE: Robust Generative Multimodal Document Key Information Extraction | Oct 24, 2023 | DecoderKey Information Extraction | CodeCode Available | 1 | 5 |
| Digitizing Historical Balance Sheet Data: A Practitioner's Guide | Mar 31, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments | Feb 10, 2025 | BenchmarkingOptical Character Recognition | CodeCode Available | 1 | 5 |
| FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems | Dec 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval | Aug 1, 2024 | AttributeOptical Character Recognition | CodeCode Available | 1 | 5 |
| Geometry Restoration and Dewarping of Camera-Captured Document Images | Jan 6, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter | Jun 10, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| Boosting on the shoulders of giants in quantum device calibration | May 13, 2020 | BIG-bench Machine LearningFew-Shot Learning | CodeCode Available | 1 | 5 |
| Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Oct 11, 2024 | Handwritten Text RecognitionHTR | CodeCode Available | 1 | 5 |
| A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition | Dec 27, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| Iranis: A Large-scale Dataset of Farsi License Plate Characters | Jan 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| BankNote-Net: Open dataset for assistive universal currency recognition | Apr 7, 2022 | Contrastive LearningFew-Shot Learning | CodeCode Available | 1 | 5 |
| A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes | Oct 1, 2020 | Multi-Label ClassificationOptical Character Recognition | CodeCode Available | 1 | 5 |
| OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |