| ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing | Nov 20, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning | Mar 18, 2024 | Handwritten Digit RecognitionOptical Character Recognition | CodeCode Available | 0 | 5 |
| Noisy Parallel Data Alignment | Jan 23, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| MultiQG-TI: Towards Question Generation from Multi-modal Sources | Jul 7, 2023 | Image to textOptical Character Recognition | CodeCode Available | 0 | 5 |
| MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts | Feb 24, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| A model of diffuse Galactic Radio Emission from 10 MHz to 100 GHz | Feb 12, 2008 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep Learning | Feb 23, 2020 | ArticlesDeep Learning | CodeCode Available | 0 | 5 |
| Are VLMs Really Blind | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 | 5 |
| Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction Encoding | May 9, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term Memory | Oct 2, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| MRZ code extraction from visa and passport documents using convolutional neural networks | Sep 11, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Mining Spatio-temporal Data on Industrialization from Historical Registries | Dec 3, 2016 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Alleviating Digitization Errors in Named Entity Recognition for Historical Documents | Nov 1, 2020 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 | 5 |
| Character decomposition to resolve class imbalance problem in Hangul OCR | Aug 12, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Chandojnanam: A Sanskrit Meter Identification and Utilization System | Sep 29, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models | Aug 30, 2024 | Articlesnamed-entity-recognition | CodeCode Available | 0 | 5 |
| ASTER: An Attentional Scene Text Recognizer with Flexible Rectification | Jun 25, 2018 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| AON: Towards Arbitrarily-Oriented Text Recognition | Nov 12, 2017 | DecoderOptical Character Recognition | CodeCode Available | 0 | 5 |
| Measuring Intersectional Biases in Historical Documents | May 21, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| memorAIs: an Optical Character Recognition and Rule-Based Medication Intake Reminder-Generating Solution | Dec 11, 2023 | FrictionOptical Character Recognition | CodeCode Available | 0 | 5 |
| Multi-modal Page Stream Segmentation with Convolutional Neural Networks | Sep 27, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway | Jan 13, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Object detection deep learning networks for Optical Character Recognition | May 1, 2019 | Deep LearningDocument Classification | CodeCode Available | 0 | 5 |
| PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System | Jun 7, 2022 | Data AugmentationOptical Character Recognition | CodeCode Available | 0 | 5 |
| Calibrated Structured Prediction | Dec 1, 2015 | Medical DiagnosisOptical Character Recognition | CodeCode Available | 0 | 5 |
| Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition | Jul 5, 2018 | GPUOptical Character Recognition | CodeCode Available | 0 | 5 |
| Answering Questions about Data Visualizations using Efficient Bimodal Fusion | Aug 5, 2019 | Chart Question AnsweringOptical Character Recognition | CodeCode Available | 0 | 5 |
| LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition | May 23, 2022 | Handwriting RecognitionKnowledge Distillation | CodeCode Available | 0 | 5 |
| Aligned Music Notation and Lyrics Transcription | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LMV-RPA: Large Model Voting-based Robotic Process Automation | Dec 23, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| License Plate Detection and Recognition in Unconstrained Scenarios | Sep 1, 2018 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 | 5 |
| Brno Mobile OCR Dataset | Jul 2, 2019 | BinarizationDenoising | CodeCode Available | 0 | 5 |
| Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription | Feb 27, 2025 | Handwritten Text RecognitionHTR | CodeCode Available | 0 | 5 |
| M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation | Jun 12, 2024 | Document Level Machine TranslationDocument Translation | CodeCode Available | 0 | 5 |
| IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition | Dec 2, 2023 | Optical Character RecognitionPrinted Text Recognition | CodeCode Available | 0 | 5 |
| Binary Document Image Super Resolution for Improved Readability and OCR Performance | Dec 6, 2018 | Image Super-ResolutionInformation Retrieval | CodeCode Available | 0 | 5 |
| iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition | Jun 27, 2022 | Face DetectionFace Recognition | CodeCode Available | 0 | 5 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| High-Throughput Phenotyping using Computer Vision and Machine Learning | Jul 8, 2024 | Image SegmentationOptical Character Recognition | CodeCode Available | 0 | 5 |
| Enhancing Cross-task Transferability of Adversarial Examples with Dispersion Reduction | May 8, 2019 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports | Jan 22, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Sep 25, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition | Nov 15, 2018 | MarketingOptical Character Recognition | CodeCode Available | 0 | 5 |
| An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers | Apr 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Gated Recurrent Convolution Neural Network for OCR | Dec 1, 2017 | General Classificationimage-classification | CodeCode Available | 0 | 5 |
| From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction | Oct 12, 2019 | BIG-bench Machine LearningMachine Translation | CodeCode Available | 0 | 5 |
| FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting | Aug 27, 2024 | BenchmarkingDecoder | CodeCode Available | 0 | 5 |
| Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models | Apr 16, 2025 | document understandingLayout Design | CodeCode Available | 0 | 5 |
| DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images | Oct 15, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |