| Calibrated Structured Prediction | Dec 1, 2015 | Medical DiagnosisOptical Character Recognition | CodeCode Available | 0 | 5 |
| Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition | Jul 5, 2018 | GPUOptical Character Recognition | CodeCode Available | 0 | 5 |
| Answering Questions about Data Visualizations using Efficient Bimodal Fusion | Aug 5, 2019 | Chart Question AnsweringOptical Character Recognition | CodeCode Available | 0 | 5 |
| LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition | May 23, 2022 | Handwriting RecognitionKnowledge Distillation | CodeCode Available | 0 | 5 |
| Aligned Music Notation and Lyrics Transcription | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LMV-RPA: Large Model Voting-based Robotic Process Automation | Dec 23, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| License Plate Detection and Recognition in Unconstrained Scenarios | Sep 1, 2018 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 | 5 |
| Brno Mobile OCR Dataset | Jul 2, 2019 | BinarizationDenoising | CodeCode Available | 0 | 5 |
| Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription | Feb 27, 2025 | Handwritten Text RecognitionHTR | CodeCode Available | 0 | 5 |
| M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation | Jun 12, 2024 | Document Level Machine TranslationDocument Translation | CodeCode Available | 0 | 5 |
| IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition | Dec 2, 2023 | Optical Character RecognitionPrinted Text Recognition | CodeCode Available | 0 | 5 |
| Binary Document Image Super Resolution for Improved Readability and OCR Performance | Dec 6, 2018 | Image Super-ResolutionInformation Retrieval | CodeCode Available | 0 | 5 |
| iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition | Jun 27, 2022 | Face DetectionFace Recognition | CodeCode Available | 0 | 5 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| High-Throughput Phenotyping using Computer Vision and Machine Learning | Jul 8, 2024 | Image SegmentationOptical Character Recognition | CodeCode Available | 0 | 5 |
| Enhancing Cross-task Transferability of Adversarial Examples with Dispersion Reduction | May 8, 2019 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports | Jan 22, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Sep 25, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition | Nov 15, 2018 | MarketingOptical Character Recognition | CodeCode Available | 0 | 5 |
| An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers | Apr 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Gated Recurrent Convolution Neural Network for OCR | Dec 1, 2017 | General Classificationimage-classification | CodeCode Available | 0 | 5 |
| From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction | Oct 12, 2019 | BIG-bench Machine LearningMachine Translation | CodeCode Available | 0 | 5 |
| FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting | Aug 27, 2024 | BenchmarkingDecoder | CodeCode Available | 0 | 5 |
| Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models | Apr 16, 2025 | document understandingLayout Design | CodeCode Available | 0 | 5 |
| DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images | Oct 15, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |