Character decomposition to resolve class imbalance problem in Hangul OCR Aug 12, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Chandojnanam: A Sanskrit Meter Identification and Utilization System Sep 29, 2022 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Mining Spatio-temporal Data on Industrialization from Historical Registries Dec 3, 2016 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features Sep 25, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 AON: Towards Arbitrarily-Oriented Text Recognition Nov 12, 2017 Decoder Optical Character Recognition
Code Code Available 05 Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Jan 9, 2025 Language Modeling Language Modelling
Code Code Available 05 AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications Aug 26, 2022 Optical Character Recognition (OCR)
Code Code Available 05 Measuring Intersectional Biases in Historical Documents May 21, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 An Unsupervised Normalization Algorithm for Noisy Text: A Case Study for Information Retrieval and Stance Detection Jan 9, 2021 Information Retrieval Optical Character Recognition (OCR)
Code Code Available 05 Case Study of a highly automated Layout Analysis and OCR of an incunabulum: 'Der Heiligen Leben' (1488) Jan 20, 2017 Optical Character Recognition (OCR)
Code Code Available 05 An Unsupervised Model of Orthographic Variation for Historical Document Transcription Jun 1, 2016 Optical Character Recognition (OCR)
Code Code Available 05 M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation Jun 12, 2024 Document Level Machine Translation Document Translation
Code Code Available 05 LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition May 23, 2022 Handwriting Recognition Knowledge Distillation
Code Code Available 05 LMV-RPA: Large Model Voting-based Robotic Process Automation Dec 23, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Calibrated Structured Prediction Dec 1, 2015 Medical Diagnosis Optical Character Recognition
Code Code Available 05 Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition Jul 5, 2018 GPU Optical Character Recognition
Code Code Available 05 Answering Questions about Data Visualizations using Efficient Bimodal Fusion Aug 5, 2019 Chart Question Answering Optical Character Recognition
Code Code Available 05 LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering Oct 16, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 05 Levenshtein OCR Sep 8, 2022 Imitation Learning Optical Character Recognition (OCR)
Code Code Available 05 Building a Part-of-Speech Tagged Corpus for Drenjongke (Bhutia) Dec 1, 2020 Optical Character Recognition (OCR) POS
Code Code Available 05 Latent Tree Language Model Nov 1, 2016 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 05 LOANet: A Lightweight Network Using Object Attention for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images Dec 16, 2022 Decoder Optical Character Recognition (OCR)
Code Code Available 05 KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents Mar 11, 2025 Optical Character Recognition (OCR) Retrieval
Code Code Available 05 KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications Mar 21, 2025 16k 4k
Code Code Available 05 A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check Oct 1, 2018 Language Modeling Language Modelling
Code Code Available 05 Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Feb 27, 2025 Handwritten Text Recognition HTR
Code Code Available 05 Brno Mobile OCR Dataset Jul 2, 2019 Binarization Denoising
Code Code Available 05 An Open Source Contractual Language Understanding Application Using Machine Learning Jun 1, 2022 Document Text Classification Information Retrieval
Code Code Available 05 Optimal Projections for Discriminative Dictionary Learning using the JL-lemma Aug 27, 2023 Dictionary Learning Dimensionality Reduction
Code Code Available 05 Advancing Post-OCR Correction: A Comparative Study of Synthetic Data Aug 5, 2024 Optical Character Recognition (OCR) Synthetic Data Generation
Code Code Available 05 It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports Jan 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Jochre 3 and the Yiddish OCR corpus Jan 14, 2025 Optical Character Recognition (OCR)
Code Code Available 05 LAREX - A semi-automatic open-source Tool for Layout Analysis and Region Extraction on Early Printed Books Jan 20, 2017 Optical Character Recognition (OCR)
Code Code Available 05 License Plate Detection and Recognition in Unconstrained Scenarios Sep 1, 2018 License Plate Detection License Plate Recognition
Code Code Available 05 Mobile User Interface Element Detection Via Adaptively Prompt Tuning May 16, 2023 object-detection Object Detection
Code Code Available 05 Order-preserving Consistency Regularization for Domain Adaptation and Generalization Sep 23, 2023 Data Augmentation Domain Adaptation
Code Code Available 05 Improving patch-based scene text script identification with ensembles of conjoined networks Feb 24, 2016 General Classification Optical Character Recognition (OCR)
Code Code Available 05 Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts Dec 15, 2019 Diversity Instance Segmentation
Code Code Available 05 Improving OCR Accuracy on Early Printed Books using Deep Convolutional Networks Feb 27, 2018 Optical Character Recognition (OCR)
Code Code Available 05 An OCR system for the Unified Northern Alphabet Jan 1, 2019 Optical Character Recognition (OCR)
Code Code Available 05 Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting Nov 27, 2017 Optical Character Recognition (OCR)
Code Code Available 05 Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Jun 1, 2025 Document AI document understanding
Code Code Available 05 Implicit Language Model in LSTM for OCR May 23, 2018 Language Modeling Language Modelling
Code Code Available 05 A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition May 7, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 Binary Document Image Super Resolution for Improved Readability and OCR Performance Dec 6, 2018 Image Super-Resolution Information Retrieval
Code Code Available 05 Improving OCR Accuracy on Early Printed Books by combining Pretraining, Voting, and Active Learning Feb 27, 2018 Active Learning Optical Character Recognition (OCR)
Code Code Available 05 InstructOCR: Instruction Boosting Scene Text Spotting Dec 20, 2024 Optical Character Recognition (OCR) Text Spotting
Code Code Available 05 An Evaluation of OCR on Egocentric Data Jun 11, 2022 Optical Character Recognition (OCR)
Code Code Available 05 BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction Mar 25, 2025 document understanding object-detection
Code Code Available 05 iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition Jun 27, 2022 Face Detection Face Recognition
Code Code Available 05