| MultiQG-TI: Towards Question Generation from Multi-modal Sources | Jul 7, 2023 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| T-MARS: Improving Visual Representations by Circumventing Text Feature Learning | Jul 6, 2023 | Optical Character Recognition | CodeCode Available | 1 |
| Resume Information Extraction via Post-OCR Text Processing | Jun 23, 2023 | Object RecognitionOptical Character Recognition | —Unverified | 0 |
| A Survey on Multimodal Large Language Models | Jun 23, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents | Jun 5, 2023 | DenoisingDocument Classification | —Unverified | 0 |
| TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain | Jun 3, 2023 | BenchmarkingDecoder | CodeCode Available | 1 |
| DuoSearch: A Novel Search Engine for Bulgarian Historical Documents | May 30, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers | May 27, 2023 | Image Super-ResolutionLicense Plate Recognition | CodeCode Available | 1 |
| Exploring Better Text Image Translation with Multimodal Codebook | May 27, 2023 | Machine TranslationOptical Character Recognition | CodeCode Available | 1 |
| Measuring Intersectional Biases in Historical Documents | May 21, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models | May 13, 2023 | Key Information ExtractionNutrition | CodeCode Available | 2 |
| E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation | May 9, 2023 | DecoderMachine Translation | CodeCode Available | 0 |
| Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library Collections | May 3, 2023 | graph constructionOptical Character Recognition | —Unverified | 0 |
| DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents | Apr 24, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Multimodal Short Video Rumor Detection System Based on Contrastive Learning | Apr 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TransDocs: Optical Character Recognition with word to word translation | Apr 15, 2023 | Deep LearningDocument Translation | CodeCode Available | 0 |
| Linking Representations with Multimodal Contrastive Learning | Apr 7, 2023 | Contrastive LearningOptical Character Recognition | —Unverified | 0 |
| Cleansing Jewel: A Neural Spelling Correction Model Built On Google OCR-ed Tibetan Manuscripts | Apr 7, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Efficient OCR for Building a Diverse Digital History | Apr 5, 2023 | DiversityImage Retrieval | CodeCode Available | 1 |
| A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision | Mar 30, 2023 | DecoderMulti-Task Learning | CodeCode Available | 0 |
| Optical Character Recognition and Transcription of Berber Signs from Images in a Low-Resource Language Amazigh | Mar 21, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset | Mar 9, 2023 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| User-Centric Evaluation of OCR Systems for Kwak'wala | Feb 26, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification | Feb 16, 2023 | Few-Shot Image ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Noisy Parallel Data Alignment | Jan 23, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| On the feasibility of attacking Thai LPR systems with adversarial examples | Jan 13, 2023 | Adversarial AttackLicense Plate Recognition | —Unverified | 0 |
| Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle | Jan 12, 2023 | CPUOptical Character Recognition | —Unverified | 0 |
| Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules | Jan 8, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling | Jan 6, 2023 | Link PredictionOptical Character Recognition | CodeCode Available | 2 |
| A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition | Dec 27, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Bengali Handwritten Digit Recognition using CNN with Explainable AI | Dec 23, 2022 | Explainable Artificial Intelligence (XAI)Handwritten Digit Recognition | —Unverified | 0 |
| Geometric Rectification of Creased Document Images based on Isometric Mapping | Dec 16, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering | Dec 16, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images | Dec 11, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| PACMAN: a framework for pulse oximeter digit detection and reading in a low-resource setting | Dec 9, 2022 | object-detectionObject Detection | —Unverified | 0 |
| SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels | Dec 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Information Retrieval from the Digitized Books | Dec 2, 2022 | Image RetrievalInformation Retrieval | —Unverified | 0 |
| Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images | Nov 25, 2022 | object-detectionObject Detection | —Unverified | 0 |
| Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images | Nov 18, 2022 | DeblurringImage Deblurring | CodeCode Available | 1 |
| Text-Aware Dual Routing Network for Visual Question Answering | Nov 17, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Efficient few-shot learning for pixel-precise handwritten document layout analysis | Oct 27, 2022 | Document Layout AnalysisFew-Shot Learning | —Unverified | 0 |
| A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail | Oct 26, 2022 | CPUOptical Character Recognition | —Unverified | 0 |
| MCSCSet: A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction | Oct 21, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| CorDeep and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents | Oct 15, 2022 | object-detectionObject Detection | —Unverified | 0 |
| MenuAI: Restaurant Food Recommendation System via a Transformer-based Deep Learning Model | Oct 15, 2022 | Food recommendationLearning-To-Rank | —Unverified | 0 |
| Text Detection Forgot About Document OCR | Oct 14, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 2 |
| Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections | Oct 7, 2022 | Key Information ExtractionLine Detection | —Unverified | 0 |
| EraseNet: A Recurrent Residual Network for Supervised Document Cleaning | Oct 3, 2022 | DenoisingOptical Character Recognition | —Unverified | 0 |
| Chandojnanam: A Sanskrit Meter Identification and Utilization System | Sep 29, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| 3D Rendering Framework for Data Augmentation in Optical Character Recognition | Sep 27, 2022 | Data AugmentationOptical Character Recognition | —Unverified | 0 |