AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions Apr 27, 2021 Optical Character Recognition (OCR)
Code Code Available 15 Neural OCR Post-Hoc Correction of Historical Corpora Feb 1, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks Dec 31, 2018 Handwriting Recognition License Plate Recognition
Code Code Available 15 Data Generation for Post-OCR correction of Cyrillic handwriting Nov 27, 2023 Handwriting generation Handwritten Text Recognition
Code Code Available 15 Lexically Aware Semi-Supervised Learning for OCR Post-Correction Nov 4, 2021 Language Modelling Optical Character Recognition
Code Code Available 15 Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents Apr 1, 2025 named-entity-recognition Named Entity Recognition
Code Code Available 15 ClusterTabNet: Supervised clustering method for table detection and table structure recognition Feb 12, 2024 Clustering Optical Character Recognition (OCR)
Code Code Available 15 Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection Mar 17, 2020 graph construction Optical Character Recognition (OCR)
Code Code Available 15 Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents Aug 6, 2021 named-entity-recognition Named Entity Recognition
Code Code Available 15 CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 15 CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks Jun 11, 2020 Optical Character Recognition (OCR) Text Detection
Code Code Available 15 Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Nov 16, 2024 Mixture-of-Experts Optical Character Recognition (OCR)
Code Code Available 15 ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages Mar 26, 2024 Machine Reading Comprehension Optical Character Recognition (OCR)
Code Code Available 15 DiT: Self-supervised Pre-training for Document Image Transformer Mar 4, 2022 Document AI document-image-classification
Code Code Available 15 A Multiplexed Network for End-to-End, Multilingual OCR Mar 29, 2021 Optical Character Recognition (OCR) Text Detection
Code Code Available 15 Digitizing Historical Balance Sheet Data: A Practitioner's Guide Mar 31, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering Jun 1, 2023 Optical Character Recognition (OCR) Question Answering
Code Code Available 15 Large Scale Font Independent Urdu Text Recognition System May 14, 2020 Incremental Learning Optical Character Recognition (OCR)
Code Code Available 15 bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents Aug 21, 2023 distortion correction Optical Character Recognition
Code Code Available 15 DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding Aug 27, 2024 document understanding Optical Character Recognition (OCR)
Code Code Available 15 DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction Dec 1, 2023 Optical Character Recognition (OCR)
Code Code Available 15 DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents Apr 24, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments Feb 10, 2025 Benchmarking Optical Character Recognition
Code Code Available 15 DocScanner: Robust Document Image Rectification with Progressive Learning Oct 28, 2021 Optical Character Recognition (OCR)
Code Code Available 15 LaTr: Layout-Aware Transformer for Scene-Text VQA Dec 23, 2021 Optical Character Recognition (OCR) Question Answering
Code Code Available 15 Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images Nov 18, 2022 Deblurring Image Deblurring
Code Code Available 15 Intrinsic Decomposition of Document Images In-the-Wild Nov 29, 2020 Document Shadow Removal Intrinsic Image Decomposition
Code Code Available 15 Indian Licence Plate Dataset in the wild Nov 11, 2021 object-detection Object Detection
Code Code Available 15 Iranis: A Large-scale Dataset of Farsi License Plate Characters Jan 1, 2021 image-classification Image Classification
Code Code Available 15 ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules Apr 5, 2023 Chart Understanding Derendering
Code Code Available 15 One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sep 20, 2024 All Dependency Parsing
Code Code Available 15 PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model Mar 24, 2025 Language Modeling Language Modelling
Code Code Available 15 Improving accuracy and speeding up Document Image Classification through parallel systems Jun 16, 2020 Document Classification document-image-classification
Code Code Available 15 Privacy-Aware Document Visual Question Answering Dec 15, 2023 document understanding Federated Learning
Code Code Available 15 Image-text matching for large-scale book collections Jul 29, 2024 Image-text matching Optical Character Recognition (OCR)
Code Code Available 15 EAST: An Efficient and Accurate Scene Text Detector Apr 11, 2017 Curved Text Detection Optical Character Recognition (OCR)
Code Code Available 15 Combining Morphological and Histogram based Text Line Segmentation in the OCR Context Mar 16, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Rerunning OCR: A Machine Learning Approach to Quality Assessment and Enhancement Prediction Oct 4, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 15 Efficient OCR for Building a Diverse Digital History Apr 5, 2023 Diversity Image Retrieval
Code Code Available 15 Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining Jul 15, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching Sep 17, 2020 Deep Learning Entity Resolution
Code Code Available 15 Image-based table recognition: data, model, and evaluation Nov 25, 2019 Articles Decoder
Code Code Available 15 FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts Nov 9, 2023 Optical Character Recognition (OCR) Safety Alignment
Code Code Available 15 From Text to Pixel: Advancing Long-Context Understanding in MLLMs May 23, 2024 Language Modeling Language Modelling
Code Code Available 15 Exploring Cross-Image Pixel Contrast for Semantic Segmentation Jan 28, 2021 Metric Learning Optical Character Recognition (OCR)
Code Code Available 15 BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Aug 10, 2021 Key Information Extraction Language Modeling
Code Code Available 15 Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation Oct 25, 2023 Handwritten Text Recognition Key Information Extraction
Code Code Available 15 Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition Nov 2, 2018 Decoder Irregular Text Recognition
Code Code Available 15 Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter Jun 10, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 LAMBERT: Layout-Aware (Language) Modeling for information extraction Feb 19, 2020 Key Information Extraction Language Modeling
Code Code Available 15