Measuring Intersectional Biases in Historical Documents May 21, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation Jun 12, 2024 Document Level Machine Translation Document Translation
Code Code Available 05 MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features Sep 25, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 MIDV-2019: Challenges of the modern mobile-based document OCR Oct 9, 2019 Face Detection Optical Character Recognition (OCR)
Code Code Available 05 LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition May 23, 2022 Handwriting Recognition Knowledge Distillation
Code Code Available 05 License Plate Detection and Recognition in Unconstrained Scenarios Sep 1, 2018 License Plate Detection License Plate Recognition
Code Code Available 05 Levenshtein OCR Sep 8, 2022 Imitation Learning Optical Character Recognition (OCR)
Code Code Available 05 LOANet: A Lightweight Network Using Object Attention for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images Dec 16, 2022 Decoder Optical Character Recognition (OCR)
Code Code Available 05 Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering Mar 14, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering Oct 16, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 05 Latent Tree Language Model Nov 1, 2016 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 05 DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks Oct 1, 2019 3D geometry Local Distortion
Code Code Available 05 LAREX - A semi-automatic open-source Tool for Layout Analysis and Region Extraction on Early Printed Books Jan 20, 2017 Optical Character Recognition (OCR)
Code Code Available 05 KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications Mar 21, 2025 16k 4k
Code Code Available 05 Jochre 3 and the Yiddish OCR corpus Jan 14, 2025 Optical Character Recognition (OCR)
Code Code Available 05 Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline May 16, 2025 Abstractive Text Summarization Language Modeling
Code Code Available 05 Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Feb 27, 2025 Handwritten Text Recognition HTR
Code Code Available 05 LMV-RPA: Large Model Voting-based Robotic Process Automation Dec 23, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents Sep 25, 2024 named-entity-recognition Named Entity Recognition
Code Code Available 05 Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models Feb 25, 2025 Optical Character Recognition (OCR)
Code Code Available 05 InstructOCR: Instruction Boosting Scene Text Spotting Dec 20, 2024 Optical Character Recognition (OCR) Text Spotting
Code Code Available 05 Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts Dec 15, 2019 Diversity Instance Segmentation
Code Code Available 05 Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning Mar 18, 2024 Handwritten Digit Recognition Optical Character Recognition
Code Code Available 05 Improving patch-based scene text script identification with ensembles of conjoined networks Feb 24, 2016 General Classification Optical Character Recognition (OCR)
Code Code Available 05 Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Jun 1, 2025 Document AI document understanding
Code Code Available 05 It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports Jan 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment Jul 17, 2025 Document Image Quality Assessment Image Quality Assessment
Code Code Available 05 Improving OCR Accuracy on Early Printed Books by combining Pretraining, Voting, and Active Learning Feb 27, 2018 Active Learning Optical Character Recognition (OCR)
Code Code Available 05 BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset Mar 9, 2023 Benchmarking Deep Learning
Code Code Available 05 DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents Apr 30, 2024 8k Diversity
Code Code Available 05 An efficient way for segmentation of Bangla characters in printed document using curved scanning May 13, 2016 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Improving OCR Accuracy on Early Printed Books using Deep Convolutional Networks Feb 27, 2018 Optical Character Recognition (OCR)
Code Code Available 05 Automatic Recognition of Learning Resource Category in a Digital Library Nov 28, 2023 document-image-classification Document Image Classification
Code Code Available 05 Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations Jul 1, 2021 Key Information Extraction Optical Character Recognition (OCR)
Code Code Available 05 Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting Nov 27, 2017 Optical Character Recognition (OCR)
Code Code Available 05 Optimal Projections for Discriminative Dictionary Learning using the JL-lemma Aug 27, 2023 Dictionary Learning Dimensionality Reduction
Code Code Available 05 KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents Mar 11, 2025 Optical Character Recognition (OCR) Retrieval
Code Code Available 05 iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition Jun 27, 2022 Face Detection Face Recognition
Code Code Available 05 High-Throughput Phenotyping using Computer Vision and Machine Learning Jul 8, 2024 Image Segmentation Optical Character Recognition
Code Code Available 05 Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction Jul 4, 2024 Language Modeling Language Modelling
Code Code Available 05 Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Network Jun 12, 2019 Optical Character Recognition (OCR) Text Segmentation
Code Code Available 05 DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images Oct 15, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Handwriting Classification for the Analysis of Art-Historical Documents Nov 4, 2020 Classification General Classification
Code Code Available 05 An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detector Sep 4, 2019 Data Augmentation GPU
Code Code Available 05 Handwritten Code Recognition for Pen-and-Paper CS Education Aug 7, 2024 Hallucination Language Modeling
Code Code Available 05 HENet: Forcing a Network to Think More for Font Recognition Oct 21, 2021 Font Recognition Optical Character Recognition (OCR)
Code Code Available 05 Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts Dec 20, 2024 Benchmarking Optical Character Recognition
Code Code Available 05 DDI-100: Dataset for Text Detection and Recognition Dec 25, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding Oct 29, 2023 Answer Generation Chart Question Answering
Code Code Available 05 Analyzing Green View Index and Green View Index best path using Google Street View and deep learning Apr 26, 2021 Optical Character Recognition (OCR) Semantic Segmentation
Code Code Available 05