| It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports | Jan 22, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers | Apr 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription | Feb 27, 2025 | Handwritten Text RecognitionHTR | CodeCode Available | 0 | 5 |
| DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents | Apr 30, 2024 | 8kDiversity | CodeCode Available | 0 | 5 |
| IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition | Dec 2, 2023 | Optical Character RecognitionPrinted Text Recognition | CodeCode Available | 0 | 5 |
| iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition | Jun 27, 2022 | Face DetectionFace Recognition | CodeCode Available | 0 | 5 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding | May 6, 2024 | Contrastive Learningdocument understanding | CodeCode Available | 0 | 5 |
| DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images | Oct 15, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset | Mar 9, 2023 | BenchmarkingDeep Learning | CodeCode Available | 0 | 5 |
| From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition | Nov 15, 2018 | MarketingOptical Character Recognition | CodeCode Available | 0 | 5 |
| Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts | Dec 20, 2024 | BenchmarkingOptical Character Recognition | CodeCode Available | 0 | 5 |
| From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction | Oct 12, 2019 | BIG-bench Machine LearningMachine Translation | CodeCode Available | 0 | 5 |
| Gated Recurrent Convolution Neural Network for OCR | Dec 1, 2017 | General Classificationimage-classification | CodeCode Available | 0 | 5 |
| High-Throughput Phenotyping using Computer Vision and Machine Learning | Jul 8, 2024 | Image SegmentationOptical Character Recognition | CodeCode Available | 0 | 5 |
| DDI-100: Dataset for Text Detection and Recognition | Dec 25, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting | Aug 27, 2024 | BenchmarkingDecoder | CodeCode Available | 0 | 5 |
| A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition | May 7, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Mining Spatio-temporal Data on Industrialization from Historical Registries | Dec 3, 2016 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study | Dec 29, 2024 | Motion DetectionOptical Character Recognition | CodeCode Available | 0 | 5 |
| End-to-End Optical Character Recognition for Bengali Handwritten Words | May 9, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks | Jul 30, 2023 | MathOptical Character Recognition | CodeCode Available | 0 | 5 |
| Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models | Apr 16, 2025 | document understandingLayout Design | CodeCode Available | 0 | 5 |
| DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives | Nov 14, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs | Jul 11, 2018 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |