LMV-RPA: Large Model Voting-based Robotic Process Automation Dec 23, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations Jul 1, 2021 Key Information Extraction Optical Character Recognition (OCR)
Code Code Available 0An Open Source Contractual Language Understanding Application Using Machine Learning Jun 1, 2022 Document Text Classification Information Retrieval
Code Code Available 0Document Image Cleaning using Budget-Aware Black-Box Approximation Jun 22, 2023 Optical Character Recognition (OCR)
Code Code Available 0iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition Jun 27, 2022 Face Detection Face Recognition
Code Code Available 0Alleviating Digitization Errors in Named Entity Recognition for Historical Documents Nov 1, 2020 named-entity-recognition Named Entity Recognition
Code Code Available 0An OCR system for the Unified Northern Alphabet Jan 1, 2019 Optical Character Recognition (OCR)
Code Code Available 0Parallel Iterative Edit Models for Local Sequence Transduction Oct 7, 2019 Decoder Grammatical Error Correction
Code Code Available 0CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models Aug 30, 2024 Articles named-entity-recognition
Code Code Available 0PDFAnno: a Web-based Linguistic Annotation Tool for PDF Documents May 1, 2018 Coreference Resolution Optical Character Recognition (OCR)
Code Code Available 0Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts Oct 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction Encoding May 9, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation Jun 12, 2024 Document Level Machine Translation Document Translation
Code Code Available 0A Data-driven Investigation of Euphemistic Language: Comparing the usage of "slave" and "servant" in 19th century US newspapers Mar 19, 2025 Optical Character Recognition (OCR)
Code Code Available 0DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Jun 25, 2024 Computational Efficiency Optical Character Recognition (OCR)
Code Code Available 0Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction Jul 4, 2024 Language Modeling Language Modelling
Code Code Available 0Adapting the Tesseract Open Source OCR Engine for Multilingual OCR Jul 25, 2009 Optical Character Recognition (OCR)
Code Code Available 0Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks Jul 30, 2023 Math Optical Character Recognition
Code Code Available 0High-Throughput Phenotyping using Computer Vision and Machine Learning Jul 8, 2024 Image Segmentation Optical Character Recognition
Code Code Available 0HENet: Forcing a Network to Think More for Font Recognition Oct 21, 2021 Font Recognition Optical Character Recognition (OCR)
Code Code Available 0DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness Nov 29, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 0PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network Apr 12, 2021 Decoder Optical Character Recognition (OCR)
Code Code Available 0PHD: Pixel-Based Language Modeling of Historical Documents Oct 22, 2023 Language Modeling Language Modelling
Code Code Available 0MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features Sep 25, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Network Jun 12, 2019 Optical Character Recognition (OCR) Text Segmentation
Code Code Available 0DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks Oct 1, 2019 3D geometry Local Distortion
Code Code Available 0Single Classifier-based Passive System for Source Printer Classification using Local Texture Features Jun 22, 2017 General Classification Optical Character Recognition (OCR)
Code Code Available 0Measuring Intersectional Biases in Historical Documents May 21, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models Feb 25, 2025 Optical Character Recognition (OCR)
Code Code Available 0Handwritten Code Recognition for Pen-and-Paper CS Education Aug 7, 2024 Hallucination Language Modeling
Code Code Available 0PIXELMOD: Improving Soft Moderation of Visual Misleading Information on Twitter Jul 30, 2024 Misinformation Optical Character Recognition
Code Code Available 0An Evaluation of OCR on Egocentric Data Jun 11, 2022 Optical Character Recognition (OCR)
Code Code Available 0Attention-based Extraction of Structured Information from Street View Imagery Apr 11, 2017 Optical Character Recognition (OCR)
Code Code Available 0An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers Apr 15, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Chinese Text in the Wild Feb 28, 2018 Optical Character Recognition (OCR)
Code Code Available 0Handwriting Classification for the Analysis of Art-Historical Documents Nov 4, 2020 Classification General Classification
Code Code Available 0MIDV-2019: Challenges of the modern mobile-based document OCR Oct 9, 2019 Face Detection Optical Character Recognition (OCR)
Code Code Available 0DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment Jul 17, 2025 Document Image Quality Assessment Image Quality Assessment
Code Code Available 0Aligned Music Notation and Lyrics Transcription Dec 5, 2024 Language Modeling Language Modelling
Code Code Available 0Analyzing Green View Index and Green View Index best path using Google Street View and deep learning Apr 26, 2021 Optical Character Recognition (OCR) Semantic Segmentation
Code Code Available 0PopEval: A Character-Level Approach to End-To-End Evaluation Compatible with Word-Level Benchmark Dataset Aug 29, 2019 Optical Character Recognition (OCR)
Code Code Available 0Mining Spatio-temporal Data on Industrialization from Historical Registries Dec 3, 2016 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents Apr 30, 2024 8k Diversity
Code Code Available 0Post-OCR parsing: building simple and robust parser via BIO tagging Sep 14, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Post-OCR Text Correction for Bulgarian Historical Documents Aug 31, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0An efficient way for segmentation of Bangla characters in printed document using curved scanning May 13, 2016 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images Oct 15, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts Dec 20, 2024 Benchmarking Optical Character Recognition
Code Code Available 0When Vision Fails: Text Attacks Against ViT and OCR Jun 12, 2023 Optical Character Recognition (OCR)
Code Code Available 0Predicting the Past: Estimating Historical Appraisals with OCR and Machine Learning May 30, 2025 Optical Character Recognition (OCR)
Code Code Available 0