Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Jun 1, 2025 Document AI document understanding
Code Code Available 05 InstructOCR: Instruction Boosting Scene Text Spotting Dec 20, 2024 Optical Character Recognition (OCR) Text Spotting
Code Code Available 05 DDI-100: Dataset for Text Detection and Recognition Dec 25, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding Oct 29, 2023 Answer Generation Chart Question Answering
Code Code Available 05 Improving patch-based scene text script identification with ensembles of conjoined networks Feb 24, 2016 General Classification Optical Character Recognition (OCR)
Code Code Available 05 Data-Driven Spelling Correction using Weighted Finite-State Methods Aug 1, 2016 Optical Character Recognition (OCR) Spelling Correction
Code Code Available 05 Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting Nov 27, 2017 Optical Character Recognition (OCR)
Code Code Available 05 Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts Dec 15, 2019 Diversity Instance Segmentation
Code Code Available 05 Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents Sep 25, 2024 named-entity-recognition Named Entity Recognition
Code Code Available 05 Data Centric Domain Adaptation for Historical Text with OCR Errors Jul 2, 2021 Cross-Domain Named Entity Recognition Domain Adaptation
Code Code Available 05 Implicit Language Model in LSTM for OCR May 23, 2018 Language Modeling Language Modelling
Code Code Available 05 Crossing Language Borders: A Pipeline for Indonesian Manhwa Translation Jan 3, 2025 Machine Translation Object Detection
Code Code Available 05 iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition Jun 27, 2022 Face Detection Face Recognition
Code Code Available 05 Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks Jul 30, 2023 Math Optical Character Recognition
Code Code Available 05 Improving OCR Accuracy on Early Printed Books by combining Pretraining, Voting, and Active Learning Feb 27, 2018 Active Learning Optical Character Recognition (OCR)
Code Code Available 05 Attention-based Extraction of Structured Information from Street View Imagery Apr 11, 2017 Optical Character Recognition (OCR)
Code Code Available 05 High-Throughput Phenotyping using Computer Vision and Machine Learning Jul 8, 2024 Image Segmentation Optical Character Recognition
Code Code Available 05 Corpus for Coreference Resolution on Scientific Papers May 1, 2014 coreference-resolution Coreference Resolution
Code Code Available 05 Document Image Cleaning using Budget-Aware Black-Box Approximation Jun 22, 2023 Optical Character Recognition (OCR)
Code Code Available 05 A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition May 7, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Enhancing Cross-task Transferability of Adversarial Examples with Dispersion Reduction May 8, 2019 image-classification Image Classification
Code Code Available 05 Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study Dec 29, 2024 Motion Detection Optical Character Recognition
Code Code Available 05 Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction Jul 4, 2024 Language Modeling Language Modelling
Code Code Available 05 An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents May 16, 2025 Form Language Modeling
Code Code Available 05 CORD: A Consolidated Receipt Dataset for Post-OCR Parsing Sep 14, 2019 Optical Character Recognition (OCR) Semantic Parsing
Code Code Available 05 Convolution-based Probability Gradient Loss for Semantic Segmentation Apr 10, 2024 Optical Character Recognition (OCR) Semantic Segmentation
Code Code Available 05 Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning Jul 9, 2025 Benchmarking Image Retrieval
Code Code Available 05 Order-preserving Consistency Regularization for Domain Adaptation and Generalization Sep 23, 2023 Data Augmentation Domain Adaptation
Code Code Available 05 Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Network Jun 12, 2019 Optical Character Recognition (OCR) Text Segmentation
Code Code Available 05 HENet: Forcing a Network to Think More for Font Recognition Oct 21, 2021 Font Recognition Optical Character Recognition (OCR)
Code Code Available 05 Improving OCR Accuracy on Early Printed Books using Deep Convolutional Networks Feb 27, 2018 Optical Character Recognition (OCR)
Code Code Available 05 DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives Nov 14, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 LOANet: A Lightweight Network Using Object Attention for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images Dec 16, 2022 Decoder Optical Character Recognition (OCR)
Code Code Available 05 A Tool for Facilitating OCR Postediting in Historical Documents Apr 23, 2020 Language Modeling Language Modelling
Code Code Available 05 A template-independent approach for information extraction in real estate documents May 30, 2023 Information Retrieval Natural Language Understanding
Code Code Available 05 Analyzing Green View Index and Green View Index best path using Google Street View and deep learning Apr 26, 2021 Optical Character Recognition (OCR) Semantic Segmentation
Code Code Available 05 GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding May 6, 2024 Contrastive Learning document understanding
Code Code Available 05 From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition Nov 15, 2018 Marketing Optical Character Recognition
Code Code Available 05 E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text Jan 30, 2018 Optical Character Recognition (OCR)
Code Code Available 05 Brno Mobile OCR Dataset Jul 2, 2019 Binarization Denoising
Code Code Available 05 From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction Oct 12, 2019 BIG-bench Machine Learning Machine Translation
Code Code Available 05 Early evidence of how LLMs outperform traditional systems on OCR/HTR tasks for historical records Jan 20, 2025 HTR Optical Character Recognition (OCR)
Code Code Available 05 Gated Recurrent Convolution Neural Network for OCR Dec 1, 2017 General Classification image-classification
Code Code Available 05 A Multi-Object Rectified Attention Network for Scene Text Recognition Jan 10, 2019 Decoder Object
Code Code Available 05 Handwriting Classification for the Analysis of Art-Historical Documents Nov 4, 2020 Classification General Classification
Code Code Available 05 EATEN: Entity-aware Attention for Single Shot Visual Text Extraction Sep 20, 2019 Decoder Entity Extraction using GAN
Code Code Available 05 Quantifying Character Similarity with Vision Transformers May 24, 2023 Optical Character Recognition (OCR)
Code Code Available 05 FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting Aug 27, 2024 Benchmarking Decoder
Code Code Available 05 FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Aug 17, 2023 Image Retrieval Logo Recognition
Code Code Available 05 A Survey of Deep Learning Approaches for OCR and Document Understanding Nov 27, 2020 document understanding Optical Character Recognition (OCR)
Code Code Available 05