GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding May 6, 2024 Contrastive Learning document understanding
Code Code Available 0Gated Recurrent Convolution Neural Network for OCR Dec 1, 2017 General Classification image-classification
Code Code Available 0SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition Feb 17, 2021 Handwriting Recognition Handwritten Text Recognition
Code Code Available 0Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation Jul 9, 2024 Decoder Image Generation
Code Code Available 0Mobile User Interface Element Detection Via Adaptively Prompt Tuning May 16, 2023 object-detection Object Detection
Code Code Available 0DDI-100: Dataset for Text Detection and Recognition Dec 25, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition Nov 15, 2018 Marketing Optical Character Recognition
Code Code Available 0AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications Aug 26, 2022 Optical Character Recognition (OCR)
Code Code Available 0A Tool for Facilitating OCR Postediting in Historical Documents Apr 23, 2020 Language Modeling Language Modelling
Code Code Available 0ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing Nov 20, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction Oct 12, 2019 BIG-bench Machine Learning Machine Translation
Code Code Available 0MRZ code extraction from visa and passport documents using convolutional neural networks Sep 11, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Word-Level Alignment of Paper Documents with their Electronic Full-Text Counterparts Apr 30, 2021 Optical Character Recognition (OCR)
Code Code Available 0ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep Learning Feb 23, 2020 Articles Deep Learning
Code Code Available 0A template-independent approach for information extraction in real estate documents May 30, 2023 Information Retrieval Natural Language Understanding
Code Code Available 0Multi-Granularity Prediction for Scene Text Recognition Sep 8, 2022 Language Modeling Language Modelling
Code Code Available 0PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language May 15, 2025 Benchmarking Optical Character Recognition
Code Code Available 0A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check Oct 1, 2018 Language Modeling Language Modelling
Code Code Available 0State of the Art Optical Character Recognition of 19th Century Fraktur Scripts using Open Source Engines Oct 8, 2018 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs Jul 11, 2018 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0TF-LM: TensorFlow-based Language Modeling Toolkit May 1, 2018 Language Modeling Language Modelling
Code Code Available 0Multimodal deep networks for text and image-based document classification Jul 15, 2019 Classification Document Classification
Code Code Available 0FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting Aug 27, 2024 Benchmarking Decoder
Code Code Available 0FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Aug 17, 2023 Image Retrieval Logo Recognition
Code Code Available 0Multi-modal Page Stream Segmentation with Convolutional Neural Networks Sep 27, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Quantifying Character Similarity with Vision Transformers May 24, 2023 Optical Character Recognition (OCR)
Code Code Available 0Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline May 16, 2025 Abstractive Text Summarization Language Modeling
Code Code Available 0Are VLMs Really Blind Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 0DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding Oct 29, 2023 Answer Generation Chart Question Answering
Code Code Available 0MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts Feb 24, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism Apr 29, 2024 document understanding GPU
Code Code Available 0STEP -- Towards Structured Scene-Text Spotting Sep 5, 2023 Optical Character Recognition (OCR) Scene Text Detection
Code Code Available 0Reading Between the Mud: A Challenging Motorcycle Racer Number Dataset Nov 14, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0AON: Towards Arbitrarily-Oriented Text Recognition Nov 12, 2017 Decoder Optical Character Recognition
Code Code Available 0Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models Feb 18, 2025 Image to text Optical Character Recognition
Code Code Available 0Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models Apr 16, 2025 document understanding Layout Design
Code Code Available 0Enhancing Assamese NLP Capabilities: Introducing a Centralized Dataset Repository Oct 15, 2024 Diversity Machine Translation
Code Code Available 0STN-OCR: A single Neural Network for Text Detection and Text Recognition Jul 27, 2017 Optical Character Recognition (OCR) Scene Text Detection
Code Code Available 0Character decomposition to resolve class imbalance problem in Hangul OCR Aug 12, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Upcycle Your OCR: Reusing OCRs for Post-OCR Text Correction in Romanised Sanskrit Sep 6, 2018 Optical Character Recognition (OCR)
Code Code Available 0NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term Memory Oct 2, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0End-to-End Optical Character Recognition for Bengali Handwritten Words May 9, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Data-Driven Spelling Correction using Weighted Finite-State Methods Aug 1, 2016 Optical Character Recognition (OCR) Spelling Correction
Code Code Available 0Data Centric Domain Adaptation for Historical Text with OCR Errors Jul 2, 2021 Cross-Domain Named Entity Recognition Domain Adaptation
Code Code Available 0End-to-End Interpretation of the French Street Name Signs Dataset Feb 13, 2017 Optical Character Recognition (OCR)
Code Code Available 0Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling May 25, 2021 Language Modeling Language Modelling
Code Code Available 0Stroke extraction for offline handwritten mathematical expression recognition May 16, 2019 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training Mar 1, 2023 Document Image Classification image-classification
Code Code Available 0Noisy Parallel Data Alignment Jan 23, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues Dec 17, 2024 Language Modeling Language Modelling
Code Code Available 0