| Named Entity Recognition in Historic Legal Text: A Transformer and State Machine Ensemble Method | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts | Oct 22, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Optical Character Recognition of 19th Century Classical Commentaries: the Current State of Affairs | Oct 13, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition | Oct 7, 2021 | Label Error DetectionOptical Character Recognition | CodeCode Available | 1 |
| A Proposal of Automatic Error Correction in Text | Sep 24, 2021 | Information RetrievalLanguage Modelling | —Unverified | 0 |
| TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models | Sep 21, 2021 | Handwritten Text RecognitionLanguage Modeling | CodeCode Available | 1 |
| Deep learning-based NLP Data Pipeline for EHR Scanned Document Information Extraction | Sep 14, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Post-OCR Document Correction with large Ensembles of Character Sequence-to-Sequence Models | Sep 13, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System | Sep 7, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 2 |
| A Novel Machine Learning Based Approach for Post-OCR Error Detection | Sep 1, 2021 | BIG-bench Machine LearningOptical Character Recognition | —Unverified | 0 |
| OCR Processing of Swedish Historical Newspapers Using Deep Hybrid CNN–LSTM Networks | Sep 1, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| A Multimodal Framework for Video Ads Understanding | Aug 29, 2021 | MarketingOptical Character Recognition | —Unverified | 0 |
| Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling | Aug 20, 2021 | Data AblationOptical Character Recognition | —Unverified | 0 |
| VisBuddy -- A Smart Wearable Assistant for the Visually Challenged | Aug 17, 2021 | Image Captioningobject-detection | —Unverified | 0 |
| Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents | Aug 6, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining | Jul 15, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| An End-to-End Khmer Optical Character Recognition using Sequence-to-Sequence with Attention | Jun 21, 2021 | DecoderOptical Character Recognition | —Unverified | 0 |
| Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences | Jun 20, 2021 | DecoderOptical Character Recognition | —Unverified | 0 |
| Classification of Documents Extracted from Images with Optical Character Recognition Methods | Jun 15, 2021 | BIG-bench Machine LearningOptical Character Recognition | —Unverified | 0 |
| Mixed Model OCR Training on Historical Latin Script for Out-of-the-Box Recognition and Finetuning | Jun 15, 2021 | Data AugmentationOptical Character Recognition | —Unverified | 0 |
| Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter | Jun 10, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Classification of Contract-Amendment Relationships | Jun 8, 2021 | ClassificationManagement | —Unverified | 0 |
| PAM: Understanding Product Images in Cross Product Category Attribute Extraction | Jun 8, 2021 | AttributeAttribute Extraction | —Unverified | 0 |
| Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods | May 31, 2021 | ArticlesBIG-bench Machine Learning | —Unverified | 0 |
| Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling | May 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |