Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval Jul 1, 2020 Optical Character Recognition (OCR) Retrieval
Code Code Available 1Improving accuracy and speeding up Document Image Classification through parallel systems Jun 16, 2020 Document Classification document-image-classification
Code Code Available 1CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks Jun 11, 2020 Optical Character Recognition (OCR) Text Detection
Code Code Available 1Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders Jun 10, 2020 Cell Segmentation Denoising
Code Code Available 1Structured Multimodal Attentions for TextVQA Jun 1, 2020 Graph Attention Optical Character Recognition (OCR)
Code Code Available 1SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition May 22, 2020 Decoder Optical Character Recognition (OCR)
Code Code Available 1Large Scale Font Independent Urdu Text Recognition System May 14, 2020 Incremental Learning Optical Character Recognition (OCR)
Code Code Available 1NAT: Noise-Aware Training for Robust Neural Sequence Labeling May 14, 2020 Data Augmentation named-entity-recognition
Code Code Available 1The Newspaper Navigator Dataset: Extracting And Analyzing Visual Content from 16 Million Historic Newspaper Pages in Chronicling America May 4, 2020 Optical Character Recognition (OCR)
Code Code Available 1PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks Apr 16, 2020 Graph Learning Key Information Extraction
Code Code Available 1ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation Mar 23, 2020 Domain Adaptation Handwriting generation
Code Code Available 1Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection Mar 17, 2020 graph construction Optical Character Recognition (OCR)
Code Code Available 1LAMBERT: Layout-Aware (Language) Modeling for information extraction Feb 19, 2020 Key Information Extraction Language Modeling
Code Code Available 1Image-based table recognition: data, model, and evaluation Nov 25, 2019 Articles Decoder
Code Code Available 1FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents May 27, 2019 Form Optical Character Recognition
Code Code Available 1Shape Robust Text Detection with Progressive Scale Expansion Network Mar 28, 2019 Optical Character Recognition (OCR) Scene Text Detection
Code Code Available 1Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks Dec 31, 2018 Handwriting Recognition License Plate Recognition
Code Code Available 1Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition Nov 2, 2018 Decoder Irregular Text Recognition
Code Code Available 1A Robust Real-Time Automatic License Plate Recognition Based on the YOLO Detector Feb 26, 2018 Data Augmentation License Plate Detection
Code Code Available 1EAST: An Efficient and Accurate Scene Text Detector Apr 11, 2017 Curved Text Detection Optical Character Recognition (OCR)
Code Code Available 1VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning Jul 17, 2025 Language Modeling Language Modelling
Code Code Available 0DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment Jul 17, 2025 Document Image Quality Assessment Image Quality Assessment
Code Code Available 0Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis Jul 15, 2025 Marketing Optical Character Recognition
— Unverified 0A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends Jul 14, 2025 document understanding Optical Character Recognition
— Unverified 0Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices Jul 9, 2025 Boundary Detection Optical Character Recognition (OCR)
— Unverified 0Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning Jul 9, 2025 Benchmarking Image Retrieval
Code Code Available 0PaddleOCR 3.0 Technical Report Jul 8, 2025 document understanding Key Information Extraction
— Unverified 0TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision Jul 8, 2025 Image Generation Optical Character Recognition (OCR)
— Unverified 0DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document Images Jun 26, 2025 document understanding Optical Character Recognition (OCR)
Code Code Available 0Logios : An open source Greek Polytonic Optical Character Recognition system Jun 26, 2025 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Engineering RAG Systems for Real-World Applications: Design, Development, and Evaluation Jun 25, 2025 Optical Character Recognition (OCR) RAG
— Unverified 0Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models Jun 25, 2025 document understanding Hallucination
— Unverified 0Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages Jun 22, 2025 image-classification Image Classification
— Unverified 0An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW Jun 18, 2025 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0FormGym: Doing Paperwork with Agents Jun 17, 2025 Form Information Retrieval
— Unverified 0AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding Jun 16, 2025 Optical Character Recognition (OCR) RAG
Code Code Available 0MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Jun 16, 2025 Optical Character Recognition (OCR)
— Unverified 0Efficient Medical VIE via Reinforcement Learning Jun 16, 2025 Diversity Optical Character Recognition (OCR)
— Unverified 0Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers Jun 12, 2025 Hallucination Optical Character Recognition (OCR)
— Unverified 0Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models Jun 12, 2025 Large Language Model Optical Character Recognition
— Unverified 0The OCR Quest for Generalization: Learning to recognize low-resource alphabets with model editing Jun 7, 2025 Meta-Learning Model Editing
— Unverified 0Reading in the Dark with Foveated Event Vision Jun 7, 2025 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions Jun 5, 2025 Computational Efficiency document understanding
— Unverified 0Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Jun 1, 2025 Document AI document understanding
Code Code Available 0Predicting the Past: Estimating Historical Appraisals with OCR and Machine Learning May 30, 2025 Optical Character Recognition (OCR)
Code Code Available 0SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text Recognition May 30, 2025 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Synthetic Document Question Answering in Hungarian May 29, 2025 Optical Character Recognition (OCR) Question Answering
Code Code Available 0ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering May 29, 2025 Chart Question Answering Chart Understanding
— Unverified 0TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance May 29, 2025 Image Super-Resolution Optical Character Recognition
— Unverified 0E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing May 27, 2025 Decision Making Optical Character Recognition (OCR)
— Unverified 0