FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems Dec 15, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Exploring Better Text Image Translation with Multimodal Codebook May 27, 2023 Machine Translation Optical Character Recognition
Code Code Available 1Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks Dec 31, 2018 Handwriting Recognition License Plate Recognition
Code Code Available 1FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions May 28, 2023 Attribute Image Captioning
Code Code Available 1Attack of the Tails: Yes, You Really Can Backdoor Federated Learning Jul 9, 2020 Fairness Federated Learning
Code Code Available 1Exploring Cross-Image Pixel Contrast for Semantic Segmentation Jan 28, 2021 Metric Learning Optical Character Recognition (OCR)
Code Code Available 1GenPlot: Increasing the Scale and Diversity of Chart Derendering Data Jun 20, 2023 Derendering Diversity
Code Code Available 1Geometry Restoration and Dewarping of Camera-Captured Document Images Jan 6, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net Jun 2, 2021 Optical Character Recognition (OCR)
Code Code Available 1AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions Apr 27, 2021 Optical Character Recognition (OCR)
Code Code Available 1Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Oct 11, 2024 Handwritten Text Recognition HTR
Code Code Available 1Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Nov 16, 2024 Mixture-of-Experts Optical Character Recognition (OCR)
Code Code Available 1Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach Aug 27, 2024 License Plate Recognition Optical Character Recognition
Code Code Available 1Image-text matching for large-scale book collections Jul 29, 2024 Image-text matching Optical Character Recognition (OCR)
Code Code Available 1Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation Oct 25, 2023 Handwritten Text Recognition Key Information Extraction
Code Code Available 1Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval Aug 1, 2024 Attribute Optical Character Recognition
Code Code Available 1DSG: An End-to-End Document Structure Generator Oct 13, 2023 Optical Character Recognition (OCR)
Code Code Available 1Document Dewarping with Control Points Mar 20, 2022 Optical Character Recognition (OCR)
Code Code Available 1bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents Aug 21, 2023 distortion correction Optical Character Recognition
Code Code Available 1LAMBERT: Layout-Aware (Language) Modeling for information extraction Feb 19, 2020 Key Information Extraction Language Modeling
Code Code Available 1DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction Oct 25, 2021 Optical Character Recognition (OCR)
Code Code Available 1EAST: An Efficient and Accurate Scene Text Detector Apr 11, 2017 Curved Text Detection Optical Character Recognition (OCR)
Code Code Available 1Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments Feb 10, 2025 Benchmarking Optical Character Recognition
Code Code Available 1Lexically Aware Semi-Supervised Learning for OCR Post-Correction Nov 4, 2021 Language Modelling Optical Character Recognition
Code Code Available 1DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents Apr 24, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules Apr 5, 2023 Chart Understanding Derendering
Code Code Available 1A Multiplexed Network for End-to-End, Multilingual OCR Mar 29, 2021 Optical Character Recognition (OCR) Text Detection
Code Code Available 1DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction Dec 1, 2023 Optical Character Recognition (OCR)
Code Code Available 1DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding Aug 27, 2024 document understanding Optical Character Recognition (OCR)
Code Code Available 1DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding Jan 1, 2025 document understanding Optical Character Recognition (OCR)
Code Code Available 1DocScanner: Robust Document Image Rectification with Progressive Learning Oct 28, 2021 Optical Character Recognition (OCR)
Code Code Available 1Easter2.0: Improving convolutional models for handwritten text recognition May 30, 2022 Data Augmentation Few-Shot Learning
Code Code Available 1Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents Apr 1, 2025 named-entity-recognition Named Entity Recognition
Code Code Available 1Modular Multimodal Machine Learning for Extraction of Theorems and Proofs in Long Scientific Documents (Extended Version) Jul 18, 2023 Articles Document AI
Code Code Available 1Digitizing Historical Balance Sheet Data: A Practitioner's Guide Mar 31, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Detection of Furigana Text in Images Jul 8, 2022 object-detection Object Detection
Code Code Available 1NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research Nov 15, 2022 Continual Learning Diversity
Code Code Available 1DiT: Self-supervised Pre-training for Document Image Transformer Mar 4, 2022 Document AI document-image-classification
Code Code Available 1DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement Oct 17, 2020 Binarization Deblurring
Code Code Available 1One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sep 20, 2024 All Dependency Parsing
Code Code Available 1Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders Jun 10, 2020 Cell Segmentation Denoising
Code Code Available 1Data Generation for Post-OCR correction of Cyrillic handwriting Nov 27, 2023 Handwriting generation Handwritten Text Recognition
Code Code Available 1Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model Apr 19, 2021 Language Modeling Language Modelling
Code Code Available 1CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset Jun 6, 2024 object-detection Object Detection
Code Code Available 1Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection Mar 17, 2020 graph construction Optical Character Recognition (OCR)
Code Code Available 1Combining Morphological and Histogram based Text Line Segmentation in the OCR Context Mar 16, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching Sep 17, 2020 Deep Learning Entity Resolution
Code Code Available 1Post-OCR Document Correction with large Ensembles of Character Sequence-to-Sequence Models Sep 13, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition Dec 27, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Confidence-aware Non-repetitive Multimodal Transformers for TextCaps Dec 7, 2020 Image Captioning Optical Character Recognition
Code Code Available 1