LAMBERT: Layout-Aware (Language) Modeling for information extraction Feb 19, 2020 Key Information Extraction Language Modeling
Code Code Available 15 Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification Feb 16, 2023 Few-Shot Image Classification Few-Shot Learning
Code Code Available 15 Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images Nov 18, 2022 Deblurring Image Deblurring
Code Code Available 15 An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish Nov 6, 2020 Machine Translation NMT
Code Code Available 15 Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild Jul 23, 2022 Optical Character Recognition (OCR)
Code Code Available 15 DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction Dec 1, 2023 Optical Character Recognition (OCR)
Code Code Available 15 Indian Licence Plate Dataset in the wild Nov 11, 2021 object-detection Object Detection
Code Code Available 15 DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents Apr 24, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding Aug 27, 2024 document understanding Optical Character Recognition (OCR)
Code Code Available 15 DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding Jan 1, 2025 document understanding Optical Character Recognition (OCR)
Code Code Available 15 Improving accuracy and speeding up Document Image Classification through parallel systems Jun 16, 2020 Document Classification document-image-classification
Code Code Available 15 DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction Oct 25, 2021 Optical Character Recognition (OCR)
Code Code Available 15 NAT: Noise-Aware Training for Robust Neural Sequence Labeling May 14, 2020 Data Augmentation named-entity-recognition
Code Code Available 15 Document Dewarping with Control Points Mar 20, 2022 Optical Character Recognition (OCR)
Code Code Available 15 Intrinsic Decomposition of Document Images In-the-Wild Nov 29, 2020 Document Shadow Removal Intrinsic Image Decomposition
Code Code Available 15 Confidence-aware Non-repetitive Multimodal Transformers for TextCaps Dec 7, 2020 Image Captioning Optical Character Recognition
Code Code Available 15 Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter Jun 10, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Iranis: A Large-scale Dataset of Farsi License Plate Characters Jan 1, 2021 image-classification Image Classification
Code Code Available 15 An Empirical Study of Scaling Law for OCR Dec 29, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection Mar 17, 2020 graph construction Optical Character Recognition (OCR)
Code Code Available 15 OCR-IDL: OCR Annotations for Industry Document Library Dataset Feb 25, 2022 Optical Character Recognition (OCR)
Code Code Available 15 OCR Post Correction for Endangered Language Texts Nov 10, 2020 Optical Character Recognition (OCR)
Code Code Available 15 Combining Morphological and Histogram based Text Line Segmentation in the OCR Context Mar 16, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 DSG: An End-to-End Document Structure Generator Oct 13, 2023 Optical Character Recognition (OCR)
Code Code Available 15 CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset Jun 6, 2024 object-detection Object Detection
Code Code Available 15 MathReader : Text-to-Speech for Mathematical Documents Jan 13, 2025 Optical Character Recognition (OCR) text-to-speech
Code Code Available 15 HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions Sep 18, 2022 object-detection Object Detection
Code Code Available 15 ClusterTabNet: Supervised clustering method for table detection and table structure recognition Feb 12, 2024 Clustering Optical Character Recognition (OCR)
Code Code Available 15 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Oct 11, 2024 Handwritten Text Recognition HTR
Code Code Available 15 Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks Dec 31, 2018 Handwriting Recognition License Plate Recognition
Code Code Available 15 An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images Dec 3, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks Jun 11, 2020 Optical Character Recognition (OCR) Text Detection
Code Code Available 15 hmBERT: Historical Multilingual Language Models for Named Entity Recognition May 31, 2022 Language Modeling Language Modelling
Code Code Available 15 ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages Mar 26, 2024 Machine Reading Comprehension Optical Character Recognition (OCR)
Code Code Available 15 Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents Aug 23, 2022 Optical Character Recognition (OCR) Table Extraction
Code Code Available 15 German Parliamentary Corpus (GerParCor) Apr 21, 2022 Optical Character Recognition (OCR)
Code Code Available 15 ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules Apr 5, 2023 Chart Understanding Derendering
Code Code Available 15 AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions Apr 27, 2021 Optical Character Recognition (OCR)
Code Code Available 15 CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 15 Image-based table recognition: data, model, and evaluation Nov 25, 2019 Articles Decoder
Code Code Available 15 Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval Jul 1, 2020 Optical Character Recognition (OCR) Retrieval
Code Code Available 15 From Text to Pixel: Advancing Long-Context Understanding in MLLMs May 23, 2024 Language Modeling Language Modelling
Code Code Available 15 FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions May 28, 2023 Attribute Image Captioning
Code Code Available 15 Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs Mar 15, 2021 Optical Character Recognition (OCR) Synthetic Data Generation
Code Code Available 15 Attack of the Tails: Yes, You Really Can Backdoor Federated Learning Jul 9, 2020 Fairness Federated Learning
Code Code Available 15 Geometry Restoration and Dewarping of Camera-Captured Document Images Jan 6, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 FlowLearn: Evaluating Large Vision-Language Models on Flowchart Understanding Jul 6, 2024 Optical Character Recognition (OCR) Visual Question Answering (VQA)
Code Code Available 15 A Two-Step Approach for Automatic OCR Post-Correction Dec 1, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 15 A Multiplexed Network for End-to-End, Multilingual OCR Mar 29, 2021 Optical Character Recognition (OCR) Text Detection
Code Code Available 15 BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Aug 10, 2021 Key Information Extraction Language Modeling
Code Code Available 15