Chandojnanam: A Sanskrit Meter Identification and Utilization System Sep 29, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning Jul 17, 2025 Language Modeling Language Modelling
Code Code Available 0AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding Jun 16, 2025 Optical Character Recognition (OCR) RAG
Code Code Available 0Ekush: A Multipurpose and Multitype Comprehensive Database for Online Off-Line Bangla Handwritten Characters Jul 17, 2019 Optical Character Recognition (OCR)
Code Code Available 0NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition Jun 4, 2018 Decoder Optical Character Recognition (OCR)
Code Code Available 0An Unsupervised Normalization Algorithm for Noisy Text: A Case Study for Information Retrieval and Stance Detection Jan 9, 2021 Information Retrieval Optical Character Recognition (OCR)
Code Code Available 0Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Jan 9, 2025 Language Modeling Language Modelling
Code Code Available 0TransDocs: Optical Character Recognition with word to word translation Apr 15, 2023 Deep Learning Document Translation
Code Code Available 0SUT: a new multi-purpose synthetic dataset for Farsi document image analysis Nov 27, 2023 Document Classification document-image-classification
Code Code Available 0Object detection deep learning networks for Optical Character Recognition May 1, 2019 Deep Learning Document Classification
Code Code Available 0Efficient Video-Based ALPR System Using YOLO and Visual Rhythm Jan 4, 2025 License Plate Recognition Optical Character Recognition
Code Code Available 0Relation-Rich Visual Document Generator for Visual Information Extraction Apr 14, 2025 Diversity document understanding
Code Code Available 0A Survey of Deep Learning Approaches for OCR and Document Understanding Nov 27, 2020 document understanding Optical Character Recognition (OCR)
Code Code Available 0An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detector Sep 4, 2019 Data Augmentation GPU
Code Code Available 0Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction May 12, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Case Study of a highly automated Layout Analysis and OCR of an incunabulum: 'Der Heiligen Leben' (1488) Jan 20, 2017 Optical Character Recognition (OCR)
Code Code Available 0Crossing Language Borders: A Pipeline for Indonesian Manhwa Translation Jan 3, 2025 Machine Translation Object Detection
Code Code Available 0Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters Jan 1, 2024 Multi-Task Learning Optical Character Recognition
Code Code Available 0Transfer Learning for OCRopus Model Training on Early Printed Books Dec 15, 2017 Optical Character Recognition (OCR) Transfer Learning
Code Code Available 0Calibrated Structured Prediction Dec 1, 2015 Medical Diagnosis Optical Character Recognition
Code Code Available 0A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition May 7, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction Dec 5, 2024 Articles Dataset Generation
Code Code Available 0Syntactic Language Change in English and German: Metrics, Parsers, and Convergences Feb 18, 2024 Optical Character Recognition (OCR) Sentence
Code Code Available 0Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis Jan 8, 2025 License Plate Detection License Plate Recognition
Code Code Available 0EATEN: Entity-aware Attention for Single Shot Visual Text Extraction Sep 20, 2019 Decoder Entity Extraction using GAN
Code Code Available 0Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering Mar 14, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Synthetic Document Question Answering in Hungarian May 29, 2025 Optical Character Recognition (OCR) Question Answering
Code Code Available 0Corpus for Coreference Resolution on Scientific Papers May 1, 2014 coreference-resolution Coreference Resolution
Code Code Available 0Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition Jul 5, 2018 GPU Optical Character Recognition
Code Code Available 0CORD: A Consolidated Receipt Dataset for Post-OCR Parsing Sep 14, 2019 Optical Character Recognition (OCR) Semantic Parsing
Code Code Available 0Early evidence of how LLMs outperform traditional systems on OCR/HTR tasks for historical records Jan 20, 2025 HTR Optical Character Recognition (OCR)
Code Code Available 0Advancing Post-OCR Correction: A Comparative Study of Synthetic Data Aug 5, 2024 Optical Character Recognition (OCR) Synthetic Data Generation
Code Code Available 0Robust Scene Text Recognition with Automatic Rectification Mar 12, 2016 Optical Character Recognition (OCR) Scene Text Detection
Code Code Available 0Building a Part-of-Speech Tagged Corpus for Drenjongke (Bhutia) Dec 1, 2020 Optical Character Recognition (OCR) POS
Code Code Available 0Time-Aware Word Embeddings for Three Lebanese News Archives May 1, 2020 Optical Character Recognition (OCR) Word Embeddings
Code Code Available 0OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning May 22, 2025 Optical Character Recognition (OCR) Visual Reasoning
Code Code Available 0RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari Languages Dec 14, 2024 Machine Translation Optical Character Recognition
Code Code Available 0Convolution-based Probability Gradient Loss for Semantic Segmentation Apr 10, 2024 Optical Character Recognition (OCR) Semantic Segmentation
Code Code Available 0E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation May 9, 2023 Decoder Machine Translation
Code Code Available 0SAFL: A Self-Attention Scene Text Recognizer with Focal Loss Jan 1, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text Jan 30, 2018 Optical Character Recognition (OCR)
Code Code Available 0Brno Mobile OCR Dataset Jul 2, 2019 Binarization Denoising
Code Code Available 0Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway Jan 13, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0DuoSearch: A Novel Search Engine for Bulgarian Historical Documents May 30, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0ASTER: An Attentional Scene Text Recognizer with Flexible Rectification Jun 25, 2018 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents May 16, 2025 Form Language Modeling
Code Code Available 0Binary Document Image Super Resolution for Improved Readability and OCR Performance Dec 6, 2018 Image Super-Resolution Information Retrieval
Code Code Available 0A Skip-connected Multi-column Network for Isolated Handwritten Bangla Character and Digit recognition Apr 27, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Adapting the Tesseract Open-Source OCR Engine for Tamil and Sinhala Legacy Fonts and Creating a Parallel Corpus for Tamil-Sinhala-English Sep 13, 2021 Optical Character Recognition (OCR)
Code Code Available 0BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction Mar 25, 2025 document understanding object-detection
Code Code Available 0