Geometry Restoration and Dewarping of Camera-Captured Document Images Jan 6, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 German Parliamentary Corpus (GerParCor) Apr 21, 2022 Optical Character Recognition (OCR)
Code Code Available 15 One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sep 20, 2024 All Dependency Parsing
Code Code Available 15 Confidence-aware Non-repetitive Multimodal Transformers for TextCaps Dec 7, 2020 Image Captioning Optical Character Recognition
Code Code Available 15 Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents Aug 23, 2022 Optical Character Recognition (OCR) Table Extraction
Code Code Available 15 Towards Making Flowchart Images Machine Interpretable Jan 29, 2025 Code Generation Optical Character Recognition (OCR)
Code Code Available 15 CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 15 MCSCSet: A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction Oct 21, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish Nov 6, 2020 Machine Translation NMT
Code Code Available 15 HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions Sep 18, 2022 object-detection Object Detection
Code Code Available 15 ClusterTabNet: Supervised clustering method for table detection and table structure recognition Feb 12, 2024 Clustering Optical Character Recognition (OCR)
Code Code Available 15 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Oct 11, 2024 Handwritten Text Recognition HTR
Code Code Available 15 Unsupervised Audio-Visual Lecture Segmentation Oct 29, 2022 Navigate Optical Character Recognition (OCR)
Code Code Available 15 UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model Oct 8, 2023 Decoder Language Modeling
Code Code Available 15 A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching Sep 17, 2020 Deep Learning Entity Resolution
Code Code Available 15 DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction Oct 25, 2021 Optical Character Recognition (OCR)
Code Code Available 15 Image-text matching for large-scale book collections Jul 29, 2024 Image-text matching Optical Character Recognition (OCR)
Code Code Available 15 Image-based table recognition: data, model, and evaluation Nov 25, 2019 Articles Decoder
Code Code Available 15 PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents Mar 23, 2024 Articles Optical Character Recognition
Code Code Available 15 On Web-based Visual Corpus Construction for Visual Document Understanding Nov 7, 2022 document understanding Optical Character Recognition (OCR)
Code Code Available 15 LMV-RPA: Large Model Voting-based Robotic Process Automation Dec 23, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation Jun 12, 2024 Document Level Machine Translation Document Translation
Code Code Available 05 CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models Aug 30, 2024 Articles named-entity-recognition
Code Code Available 05 AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding Jun 16, 2025 Optical Character Recognition (OCR) RAG
Code Code Available 05 LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition May 23, 2022 Handwriting Recognition Knowledge Distillation
Code Code Available 05 Levenshtein OCR Sep 8, 2022 Imitation Learning Optical Character Recognition (OCR)
Code Code Available 05 Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction Encoding May 9, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 A Data-driven Investigation of Euphemistic Language: Comparing the usage of "slave" and "servant" in 19th century US newspapers Mar 19, 2025 Optical Character Recognition (OCR)
Code Code Available 05 LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering Oct 16, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 05 Are VLMs Really Blind Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 05 License Plate Detection and Recognition in Unconstrained Scenarios Sep 1, 2018 License Plate Detection License Plate Recognition
Code Code Available 05 Latent Tree Language Model Nov 1, 2016 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 05 LAREX - A semi-automatic open-source Tool for Layout Analysis and Region Extraction on Early Printed Books Jan 20, 2017 Optical Character Recognition (OCR)
Code Code Available 05 ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing Nov 20, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep Learning Feb 23, 2020 Articles Deep Learning
Code Code Available 05 KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications Mar 21, 2025 16k 4k
Code Code Available 05 KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents Mar 11, 2025 Optical Character Recognition (OCR) Retrieval
Code Code Available 05 Chinese Text in the Wild Feb 28, 2018 Optical Character Recognition (OCR)
Code Code Available 05 It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports Jan 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Optimal Projections for Discriminative Dictionary Learning using the JL-lemma Aug 27, 2023 Dictionary Learning Dimensionality Reduction
Code Code Available 05 Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents Sep 25, 2024 named-entity-recognition Named Entity Recognition
Code Code Available 05 Aligned Music Notation and Lyrics Transcription Dec 5, 2024 Language Modeling Language Modelling
Code Code Available 05 Jochre 3 and the Yiddish OCR corpus Jan 14, 2025 Optical Character Recognition (OCR)
Code Code Available 05 Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts Oct 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Alleviating Digitization Errors in Named Entity Recognition for Historical Documents Nov 1, 2020 named-entity-recognition Named Entity Recognition
Code Code Available 05 Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Jun 1, 2025 Document AI document understanding
Code Code Available 05 Adapting the Tesseract Open Source OCR Engine for Multilingual OCR Jul 25, 2009 Optical Character Recognition (OCR)
Code Code Available 05 InstructOCR: Instruction Boosting Scene Text Spotting Dec 20, 2024 Optical Character Recognition (OCR) Text Spotting
Code Code Available 05 Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts Dec 15, 2019 Diversity Instance Segmentation
Code Code Available 05 Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Feb 27, 2025 Handwritten Text Recognition HTR
Code Code Available 05