| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Lexically Aware Semi-Supervised Learning for OCR Post-Correction | Nov 4, 2021 | Language ModellingOptical Character Recognition | CodeCode Available | 1 |
| MCSCSet: A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction | Oct 21, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Detection of Furigana Text in Images | Jul 8, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes | Oct 1, 2020 | Multi-Label ClassificationOptical Character Recognition | CodeCode Available | 1 |
| Boosting on the shoulders of giants in quantum device calibration | May 13, 2020 | BIG-bench Machine LearningFew-Shot Learning | CodeCode Available | 1 |
| Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter | Jun 10, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks | Apr 16, 2020 | Graph LearningKey Information Extraction | CodeCode Available | 1 |
| A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends | Jul 14, 2025 | document understandingOptical Character Recognition | —Unverified | 0 |
| A survey of modern optical character recognition techniques | Dec 13, 2014 | Image EnhancementOptical Character Recognition | —Unverified | 0 |