Document Dewarping with Control Points Mar 20, 2022 Optical Character Recognition (OCR)
Code Code Available 1XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding Mar 14, 2022 document understanding Optical Character Recognition (OCR)
Code Code Available 1DiT: Self-supervised Pre-training for Document Image Transformer Mar 4, 2022 Document AI document-image-classification
Code Code Available 1TableFormer: Table Structure Understanding with Transformers Mar 2, 2022 Decoder object-detection
Code Code Available 1OCR-IDL: OCR Annotations for Industry Document Library Dataset Feb 25, 2022 Optical Character Recognition (OCR)
Code Code Available 1On the Cross-dataset Generalization in License Plate Recognition Jan 2, 2022 Data Augmentation License Plate Detection
Code Code Available 1LaTr: Layout-Aware Transformer for Scene-Text VQA Dec 23, 2021 Optical Character Recognition (OCR) Question Answering
Code Code Available 1An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images Dec 3, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Indian Licence Plate Dataset in the wild Nov 11, 2021 object-detection Object Detection
Code Code Available 1Lexically Aware Semi-Supervised Learning for OCR Post-Correction Nov 4, 2021 Language Modelling Optical Character Recognition
Code Code Available 1DocScanner: Robust Document Image Rectification with Progressive Learning Oct 28, 2021 Optical Character Recognition (OCR)
Code Code Available 1DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction Oct 25, 2021 Optical Character Recognition (OCR)
Code Code Available 1WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition Oct 7, 2021 Label Error Detection Optical Character Recognition
Code Code Available 1Rerunning OCR: A Machine Learning Approach to Quality Assessment and Enhancement Prediction Oct 4, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 1TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models Sep 21, 2021 Handwritten Text Recognition Language Modeling
Code Code Available 1Post-OCR Document Correction with large Ensembles of Character Sequence-to-Sequence Models Sep 13, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Aug 10, 2021 Key Information Extraction Language Modeling
Code Code Available 1Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents Aug 6, 2021 named-entity-recognition Named Entity Recognition
Code Code Available 1Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining Jul 15, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter Jun 10, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net Jun 2, 2021 Optical Character Recognition (OCR)
Code Code Available 1Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations May 23, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Unknown-box Approximation to Improve Optical Character Recognition Performance May 17, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions Apr 27, 2021 Optical Character Recognition (OCR)
Code Code Available 1Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model Apr 19, 2021 Language Modeling Language Modelling
Code Code Available 1Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages Apr 12, 2021 Machine Translation Multilingual NLP
Code Code Available 1Video-aided Unsupervised Grammar Induction Apr 9, 2021 Optical Character Recognition (OCR)
Code Code Available 1A Multiplexed Network for End-to-End, Multilingual OCR Mar 29, 2021 Optical Character Recognition (OCR) Text Detection
Code Code Available 1Combining Morphological and Histogram based Text Line Segmentation in the OCR Context Mar 16, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs Mar 15, 2021 Optical Character Recognition (OCR) Synthetic Data Generation
Code Code Available 1Neural OCR Post-Hoc Correction of Historical Corpora Feb 1, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Exploring Cross-Image Pixel Contrast for Semantic Segmentation Jan 28, 2021 Metric Learning Optical Character Recognition (OCR)
Code Code Available 1Iranis: A Large-scale Dataset of Farsi License Plate Characters Jan 1, 2021 image-classification Image Classification
Code Code Available 1FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems Dec 15, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1TAP: Text-Aware Pre-training for Text-VQA and Text-Caption Dec 8, 2020 Caption Generation Language Modeling
Code Code Available 1Confidence-aware Non-repetitive Multimodal Transformers for TextCaps Dec 7, 2020 Image Captioning Optical Character Recognition
Code Code Available 1A Two-Step Approach for Automatic OCR Post-Correction Dec 1, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Intrinsic Decomposition of Document Images In-the-Wild Nov 29, 2020 Document Shadow Removal Intrinsic Image Decomposition
Code Code Available 1OCR Post Correction for Endangered Language Texts Nov 10, 2020 Optical Character Recognition (OCR)
Code Code Available 1An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish Nov 6, 2020 Machine Translation NMT
Code Code Available 1RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering Oct 24, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1TLGAN: document Text Localization using Generative Adversarial Nets Oct 22, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement Oct 17, 2020 Binarization Deblurring
Code Code Available 1Tokenization Repair in the Presence of Spelling Errors Oct 15, 2020 Optical Character Recognition (OCR) Spelling Correction
Code Code Available 1Table Structure Recognition using Top-Down and Bottom-Up Cues Oct 9, 2020 Cell Detection Optical Character Recognition
Code Code Available 1A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes Oct 1, 2020 Multi-Label Classification Optical Character Recognition
Code Code Available 1A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching Sep 17, 2020 Deep Learning Entity Resolution
Code Code Available 1Adapting OCR with limited supervision Jul 27, 2020 Optical Character Recognition (OCR)
Code Code Available 1Spatially Aware Multimodal Transformers for TextVQA Jul 23, 2020 Optical Character Recognition (OCR) Spatial Reasoning
Code Code Available 1Attack of the Tails: Yes, You Really Can Backdoor Federated Learning Jul 9, 2020 Fairness Federated Learning
Code Code Available 1