| Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification | Feb 8, 2024 | CAPTCHA DetectionClassification | —Unverified | 0 |
| ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images | Feb 3, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information | Jan 31, 2024 | Hallucinationobject-detection | —Unverified | 0 |
| Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters | Jan 1, 2024 | Multi-Task LearningOptical Character Recognition | CodeCode Available | 0 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Chaurah: A Smart Raspberry Pi based Parking System | Dec 28, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images | Dec 20, 2023 | Optical Character RecognitionSegmentation | —Unverified | 0 |
| Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey | Dec 19, 2023 | ArticlesOptical Character Recognition | —Unverified | 0 |
| memorAIs: an Optical Character Recognition and Rule-Based Medication Intake Reminder-Generating Solution | Dec 11, 2023 | FrictionOptical Character Recognition | CodeCode Available | 0 |
| UPOCR: Towards Unified Pixel-Level OCR Interface | Dec 5, 2023 | DecoderOptical Character Recognition | —Unverified | 0 |
| IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition | Dec 2, 2023 | Optical Character RecognitionPrinted Text Recognition | CodeCode Available | 0 |
| Pipeline Enabling Zero-shot Classification for Bangla Handwritten Grapheme | Dec 1, 2023 | Bangla Text DetectionClassification | —Unverified | 0 |
| Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks | Nov 28, 2023 | Adversarial AttackOptical Character Recognition | —Unverified | 0 |
| Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents | Nov 27, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Data Generation for Post-OCR correction of Cyrillic handwriting | Nov 27, 2023 | Handwriting generationHandwritten Text Recognition | CodeCode Available | 1 |
| ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing | Nov 20, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Efficient End-to-End Visual Document Understanding with Rationale Distillation | Nov 16, 2023 | document understandingImage to text | —Unverified | 0 |
| DECDM: Document Enhancement using Cycle-Consistent Diffusion Models | Nov 16, 2023 | Data AugmentationDenoising | —Unverified | 0 |
| Reading Between the Mud: A Challenging Motorcycle Racer Number Dataset | Nov 14, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency | Nov 9, 2023 | document understandingKey Information Extraction | —Unverified | 0 |
| On Manipulating Scene Text in the Wild with Diffusion Models | Nov 1, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| GenKIE: Robust Generative Multimodal Document Key Information Extraction | Oct 24, 2023 | DecoderKey Information Extraction | CodeCode Available | 1 |
| Towards reducing hallucination in extracting information from financial reports using Large Language Models | Oct 16, 2023 | HallucinationOptical Character Recognition | —Unverified | 0 |
| EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Oct 16, 2023 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA | Oct 13, 2023 | Graph LearningObject | —Unverified | 0 |
| Invisible Threats: Backdoor Attack in OCR Systems | Oct 12, 2023 | Backdoor AttackOptical Character Recognition | —Unverified | 0 |
| Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks | Oct 8, 2023 | BinarizationCPU | CodeCode Available | 1 |
| Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges | Sep 25, 2023 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 |
| Handwritten image augmentation | Aug 26, 2023 | Data AugmentationImage Augmentation | —Unverified | 0 |
| Bengali Document Layout Analysis with Detectron2 | Aug 26, 2023 | Data AugmentationDocument Layout Analysis | —Unverified | 0 |
| Nougat: Neural Optical Understanding for Academic Documents | Aug 25, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 5 |
| DISGO: Automatic End-to-End Evaluation for Scene Text OCR | Aug 25, 2023 | Machine TranslationOptical Character Recognition | —Unverified | 0 |
| bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents | Aug 21, 2023 | distortion correctionOptical Character Recognition | CodeCode Available | 1 |
| Extraction of Text from Optic Nerve Optical Coherence Tomography Reports | Aug 21, 2023 | Optical Character Recognition | —Unverified | 0 |
| OCR Language Models with Custom Vocabularies | Aug 18, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Multimodal Analysis Of Google Bard And GPT-Vision: Experiments In Visual Reasoning | Aug 17, 2023 | Common Sense ReasoningOptical Character Recognition | —Unverified | 0 |
| OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition | Aug 4, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| CTP-Net: Character Texture Perception Network for Document Image Forgery Localization | Aug 4, 2023 | Image ForensicsOptical Character Recognition | —Unverified | 0 |
| Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations | Aug 1, 2023 | DenoisingImage Denoising | —Unverified | 0 |
| Optimizing the Neural Network Training for OCR Error Correction of Historical Hebrew Texts | Jul 30, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Toward a Period-Specific Optimized Neural Network for OCR Error Correction of Historical Hebrew Texts | Jul 30, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks | Jul 30, 2023 | MathOptical Character Recognition | CodeCode Available | 0 |
| Validation of a Zero-Shot Learning Natural Language Processing Tool for Data Abstraction from Unstructured Healthcare Data | Jul 23, 2023 | Optical Character RecognitionZero-Shot Learning | CodeCode Available | 1 |
| Handwritten and Printed Text Segmentation: A Signature Case Study | Jul 15, 2023 | Binary ClassificationOptical Character Recognition | —Unverified | 0 |
| Handwritten Text Recognition Using Convolutional Neural Network | Jul 11, 2023 | Handwritten Text RecognitionOptical Character Recognition | —Unverified | 0 |
| A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing | Jul 9, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Artificial Eye for the Blind | Jul 7, 2023 | Objectobject-detection | —Unverified | 0 |