AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions Apr 27, 2021 Optical Character Recognition (OCR)
Code Code Available 15 Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild Jul 23, 2022 Optical Character Recognition (OCR)
Code Code Available 15 Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification Feb 16, 2023 Few-Shot Image Classification Few-Shot Learning
Code Code Available 15 NAT: Noise-Aware Training for Robust Neural Sequence Labeling May 14, 2020 Data Augmentation named-entity-recognition
Code Code Available 15 Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images Nov 18, 2022 Deblurring Image Deblurring
Code Code Available 15 Data Generation for Post-OCR correction of Cyrillic handwriting Nov 27, 2023 Handwriting generation Handwritten Text Recognition
Code Code Available 15 A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes Oct 1, 2020 Multi-Label Classification Optical Character Recognition
Code Code Available 15 Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents Apr 1, 2025 named-entity-recognition Named Entity Recognition
Code Code Available 15 Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering Jun 1, 2023 Optical Character Recognition (OCR) Question Answering
Code Code Available 15 CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks Jun 11, 2020 Optical Character Recognition (OCR) Text Detection
Code Code Available 15 ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages Mar 26, 2024 Machine Reading Comprehension Optical Character Recognition (OCR)
Code Code Available 15 Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Nov 16, 2024 Mixture-of-Experts Optical Character Recognition (OCR)
Code Code Available 15 LaTr: Layout-Aware Transformer for Scene-Text VQA Dec 23, 2021 Optical Character Recognition (OCR) Question Answering
Code Code Available 15 Detection of Furigana Text in Images Jul 8, 2022 object-detection Object Detection
Code Code Available 15 Lexically Aware Semi-Supervised Learning for OCR Post-Correction Nov 4, 2021 Language Modelling Optical Character Recognition
Code Code Available 15 A Multiplexed Network for End-to-End, Multilingual OCR Mar 29, 2021 Optical Character Recognition (OCR) Text Detection
Code Code Available 15 DiT: Self-supervised Pre-training for Document Image Transformer Mar 4, 2022 Document AI document-image-classification
Code Code Available 15 Digitizing Historical Balance Sheet Data: A Practitioner's Guide Mar 31, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents Aug 21, 2023 distortion correction Optical Character Recognition
Code Code Available 15 Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders Jun 10, 2020 Cell Segmentation Denoising
Code Code Available 15 LAMBERT: Layout-Aware (Language) Modeling for information extraction Feb 19, 2020 Key Information Extraction Language Modeling
Code Code Available 15 DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding Aug 27, 2024 document understanding Optical Character Recognition (OCR)
Code Code Available 15 Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments Feb 10, 2025 Benchmarking Optical Character Recognition
Code Code Available 15 DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents Apr 24, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 ClusterTabNet: Supervised clustering method for table detection and table structure recognition Feb 12, 2024 Clustering Optical Character Recognition (OCR)
Code Code Available 15 Iranis: A Large-scale Dataset of Farsi License Plate Characters Jan 1, 2021 image-classification Image Classification
Code Code Available 15 Large Scale Font Independent Urdu Text Recognition System May 14, 2020 Incremental Learning Optical Character Recognition (OCR)
Code Code Available 15 Efficient OCR for Building a Diverse Digital History Apr 5, 2023 Diversity Image Retrieval
Code Code Available 15 Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents Aug 6, 2021 named-entity-recognition Named Entity Recognition
Code Code Available 15 Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter Jun 10, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Improving accuracy and speeding up Document Image Classification through parallel systems Jun 16, 2020 Document Classification document-image-classification
Code Code Available 15 PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks Apr 16, 2020 Graph Learning Key Information Extraction
Code Code Available 15 Image-text matching for large-scale book collections Jul 29, 2024 Image-text matching Optical Character Recognition (OCR)
Code Code Available 15 One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sep 20, 2024 All Dependency Parsing
Code Code Available 15 Indian Licence Plate Dataset in the wild Nov 11, 2021 object-detection Object Detection
Code Code Available 15 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Oct 11, 2024 Handwritten Text Recognition HTR
Code Code Available 15 DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement Oct 17, 2020 Binarization Deblurring
Code Code Available 15 Rerunning OCR: A Machine Learning Approach to Quality Assessment and Enhancement Prediction Oct 4, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 15 EAST: An Efficient and Accurate Scene Text Detector Apr 11, 2017 Curved Text Detection Optical Character Recognition (OCR)
Code Code Available 15 Easter2.0: Improving convolutional models for handwritten text recognition May 30, 2022 Data Augmentation Few-Shot Learning
Code Code Available 15 hmBERT: Historical Multilingual Language Models for Named Entity Recognition May 31, 2022 Language Modeling Language Modelling
Code Code Available 15 ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules Apr 5, 2023 Chart Understanding Derendering
Code Code Available 15 End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net Jun 2, 2021 Optical Character Recognition (OCR)
Code Code Available 15 Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval Jul 1, 2020 Optical Character Recognition (OCR) Retrieval
Code Code Available 15 A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching Sep 17, 2020 Deep Learning Entity Resolution
Code Code Available 15 BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Aug 10, 2021 Key Information Extraction Language Modeling
Code Code Available 15 SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels Dec 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach Aug 27, 2024 License Plate Recognition Optical Character Recognition
Code Code Available 15 A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition Dec 27, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 15