RealKIE: Five Novel Datasets for Enterprise Key Information Extraction Mar 29, 2024 Key Information Extraction Optical Character Recognition (OCR)
— Unverified 0Real-time information retrieval from Identity cards Mar 26, 2020 Face Detection Information Retrieval
— Unverified 0Jochre 3 and the Yiddish OCR corpus Jan 14, 2025 Optical Character Recognition (OCR)
Code Code Available 0Combining OCR Models for Reading Early Modern Printed Books May 11, 2023 Font Recognition Optical Character Recognition (OCR)
Code Code Available 0Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Feb 27, 2025 Handwritten Text Recognition HTR
Code Code Available 0Scrambled text: training Language Models to correct OCR errors using synthetic data Sep 29, 2024 Articles Language Modeling
Code Code Available 0KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents Mar 11, 2025 Optical Character Recognition (OCR) Retrieval
Code Code Available 0SEARNN: Training RNNs with Global-Local Losses Jun 14, 2017 Machine Translation Optical Character Recognition (OCR)
Code Code Available 0Document Rectification and Illumination Correction using a Patch-based CNN Sep 20, 2019 Optical Character Recognition (OCR)
Code Code Available 0Optimal Projections for Discriminative Dictionary Learning using the JL-lemma Aug 27, 2023 Dictionary Learning Dimensionality Reduction
Code Code Available 0COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images Jan 26, 2016 Diversity General Classification
Code Code Available 0KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications Mar 21, 2025 16k 4k
Code Code Available 0Clustering-Based Article Identification in Historical Newspapers Jun 1, 2019 Articles Clustering
Code Code Available 0It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports Jan 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Optical Character Recognition of 19th Century Classical Commentaries: the Current State of Affairs Oct 13, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0A Multi-Object Rectified Attention Network for Scene Text Recognition Jan 10, 2019 Decoder Object
Code Code Available 0Teaching Machines to Code: Neural Markup Generation with Visual Attention Feb 15, 2018 Math Optical Character Recognition (OCR)
Code Code Available 0An Unsupervised Model of Orthographic Variation for Historical Document Transcription Jun 1, 2016 Optical Character Recognition (OCR)
Code Code Available 0LAREX - A semi-automatic open-source Tool for Layout Analysis and Region Extraction on Early Printed Books Jan 20, 2017 Optical Character Recognition (OCR)
Code Code Available 0Toward Advancing License Plate Super-Resolution in Real-World Scenarios: A Dataset and Benchmark May 9, 2025 License Plate Recognition Optical Character Recognition
Code Code Available 0Automatic Recognition of Learning Resource Category in a Digital Library Nov 28, 2023 document-image-classification Document Image Classification
Code Code Available 0Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents Sep 25, 2024 named-entity-recognition Named Entity Recognition
Code Code Available 0Latent Tree Language Model Nov 1, 2016 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 0InstructOCR: Instruction Boosting Scene Text Spotting Dec 20, 2024 Optical Character Recognition (OCR) Text Spotting
Code Code Available 0Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Jun 1, 2025 Document AI document understanding
Code Code Available 0Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering Mar 9, 2021 Optical Character Recognition (OCR) Question Answering
Code Code Available 0Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents Nov 27, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Optimizing Nepali PDF Extraction: A Comparative Study of Parser and OCR Technologies Jul 5, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering Sep 30, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 0Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning Jul 9, 2025 Benchmarking Image Retrieval
Code Code Available 0LOANet: A Lightweight Network Using Object Attention for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images Dec 16, 2022 Decoder Optical Character Recognition (OCR)
Code Code Available 0Order-preserving Consistency Regularization for Domain Adaptation and Generalization Sep 23, 2023 Data Augmentation Domain Adaptation
Code Code Available 0LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering Oct 16, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 0Vehicle-Rear: A New Dataset to Explore Feature Fusion for Vehicle Identification Using Convolutional Neural Networks Nov 13, 2019 Fine-Grained Vehicle Classification License Plate Detection
Code Code Available 0Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts Dec 15, 2019 Diversity Instance Segmentation
Code Code Available 0Improving patch-based scene text script identification with ensembles of conjoined networks Feb 24, 2016 General Classification Optical Character Recognition (OCR)
Code Code Available 0Levenshtein OCR Sep 8, 2022 Imitation Learning Optical Character Recognition (OCR)
Code Code Available 0ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images Apr 16, 2024 Multimodal Deep Learning Optical Character Recognition (OCR)
Code Code Available 0Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding Dec 19, 2022 Contrastive Learning document understanding
Code Code Available 0Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting Nov 27, 2017 Optical Character Recognition (OCR)
Code Code Available 0License Plate Detection and Recognition in Unconstrained Scenarios Sep 1, 2018 License Plate Detection License Plate Recognition
Code Code Available 0Answering Questions about Data Visualizations using Efficient Bimodal Fusion Aug 5, 2019 Chart Question Answering Optical Character Recognition
Code Code Available 0Improving OCR Accuracy on Early Printed Books using Deep Convolutional Networks Feb 27, 2018 Optical Character Recognition (OCR)
Code Code Available 0LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition May 23, 2022 Handwriting Recognition Knowledge Distillation
Code Code Available 0OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 0A model of diffuse Galactic Radio Emission from 10 MHz to 100 GHz Feb 12, 2008 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Separate and Locate: Rethink the Text in Text-based Visual Question Answering Aug 31, 2023 Optical Character Recognition (OCR) Position
Code Code Available 0Sequence-aware multimodal page classification of Brazilian legal documents Jul 2, 2022 Classification Management
Code Code Available 0Improving OCR Accuracy on Early Printed Books by combining Pretraining, Voting, and Active Learning Feb 27, 2018 Active Learning Optical Character Recognition (OCR)
Code Code Available 0Implicit Language Model in LSTM for OCR May 23, 2018 Language Modeling Language Modelling
Code Code Available 0