An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images Dec 3, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net Jun 2, 2021 Optical Character Recognition (OCR)
Code Code Available 1Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks Dec 31, 2018 Handwriting Recognition License Plate Recognition
Code Code Available 1Attack of the Tails: Yes, You Really Can Backdoor Federated Learning Jul 9, 2020 Fairness Federated Learning
Code Code Available 1Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach Aug 27, 2024 License Plate Recognition Optical Character Recognition
Code Code Available 1Easter2.0: Improving convolutional models for handwritten text recognition May 30, 2022 Data Augmentation Few-Shot Learning
Code Code Available 1Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs Mar 15, 2021 Optical Character Recognition (OCR) Synthetic Data Generation
Code Code Available 1GenKIE: Robust Generative Multimodal Document Key Information Extraction Oct 24, 2023 Decoder Key Information Extraction
Code Code Available 1AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions Apr 27, 2021 Optical Character Recognition (OCR)
Code Code Available 1Efficient OCR for Building a Diverse Digital History Apr 5, 2023 Diversity Image Retrieval
Code Code Available 1Exploring Better Text Image Translation with Multimodal Codebook May 27, 2023 Machine Translation Optical Character Recognition
Code Code Available 1Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Nov 16, 2024 Mixture-of-Experts Optical Character Recognition (OCR)
Code Code Available 1FlowLearn: Evaluating Large Vision-Language Models on Flowchart Understanding Jul 6, 2024 Optical Character Recognition (OCR) Visual Question Answering (VQA)
Code Code Available 1hmBERT: Historical Multilingual Language Models for Named Entity Recognition May 31, 2022 Language Modeling Language Modelling
Code Code Available 1Document Dewarping with Control Points Mar 20, 2022 Optical Character Recognition (OCR)
Code Code Available 1DocScanner: Robust Document Image Rectification with Progressive Learning Oct 28, 2021 Optical Character Recognition (OCR)
Code Code Available 1Improving accuracy and speeding up Document Image Classification through parallel systems Jun 16, 2020 Document Classification document-image-classification
Code Code Available 1Indian Licence Plate Dataset in the wild Nov 11, 2021 object-detection Object Detection
Code Code Available 1DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction Dec 1, 2023 Optical Character Recognition (OCR)
Code Code Available 1Iranis: A Large-scale Dataset of Farsi License Plate Characters Jan 1, 2021 image-classification Image Classification
Code Code Available 1Large Scale Font Independent Urdu Text Recognition System May 14, 2020 Incremental Learning Optical Character Recognition (OCR)
Code Code Available 1LaTr: Layout-Aware Transformer for Scene-Text VQA Dec 23, 2021 Optical Character Recognition (OCR) Question Answering
Code Code Available 1DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction Oct 25, 2021 Optical Character Recognition (OCR)
Code Code Available 1Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images Nov 18, 2022 Deblurring Image Deblurring
Code Code Available 1DSG: An End-to-End Document Structure Generator Oct 13, 2023 Optical Character Recognition (OCR)
Code Code Available 1DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding Aug 27, 2024 document understanding Optical Character Recognition (OCR)
Code Code Available 1A Multiplexed Network for End-to-End, Multilingual OCR Mar 29, 2021 Optical Character Recognition (OCR) Text Detection
Code Code Available 1Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild Jul 23, 2022 Optical Character Recognition (OCR)
Code Code Available 1DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding Jan 1, 2025 document understanding Optical Character Recognition (OCR)
Code Code Available 1Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders Jun 10, 2020 Cell Segmentation Denoising
Code Code Available 1DocFormerv2: Local Features for Document Understanding Jun 2, 2023 Decoder document understanding
Code Code Available 1DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents Apr 24, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes Oct 1, 2020 Multi-Label Classification Optical Character Recognition
Code Code Available 1Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents Apr 1, 2025 named-entity-recognition Named Entity Recognition
Code Code Available 1DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement Oct 17, 2020 Binarization Deblurring
Code Code Available 1Detection of Furigana Text in Images Jul 8, 2022 object-detection Object Detection
Code Code Available 1Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection Mar 17, 2020 graph construction Optical Character Recognition (OCR)
Code Code Available 1NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research Nov 15, 2022 Continual Learning Diversity
Code Code Available 1One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sep 20, 2024 All Dependency Parsing
Code Code Available 1Digitizing Historical Balance Sheet Data: A Practitioner's Guide Mar 31, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Confidence-aware Non-repetitive Multimodal Transformers for TextCaps Dec 7, 2020 Image Captioning Optical Character Recognition
Code Code Available 1Combining Morphological and Histogram based Text Line Segmentation in the OCR Context Mar 16, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset Jun 6, 2024 object-detection Object Detection
Code Code Available 1ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules Apr 5, 2023 Chart Understanding Derendering
Code Code Available 1PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents Mar 23, 2024 Articles Optical Character Recognition
Code Code Available 1BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Aug 10, 2021 Key Information Extraction Language Modeling
Code Code Available 1PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks Apr 16, 2020 Graph Learning Key Information Extraction
Code Code Available 1A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching Sep 17, 2020 Deep Learning Entity Resolution
Code Code Available 1CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 1Data Generation for Post-OCR correction of Cyrillic handwriting Nov 27, 2023 Handwriting generation Handwritten Text Recognition
Code Code Available 1