| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement | Jun 3, 2024 | Jersey Number RecognitionMulti-Task Learning | —Unverified | 0 |
| Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities | May 25, 2024 | Boundary DetectionOptical Character Recognition | —Unverified | 0 |
| Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Transfer Learning Approach for Railway Technical Map (RTM) Component Identification | May 21, 2024 | Managementobject-detection | —Unverified | 0 |
| GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding | May 6, 2024 | Contrastive Learningdocument understanding | CodeCode Available | 0 |
| DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents | Apr 30, 2024 | 8kDiversity | CodeCode Available | 0 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 |
| ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images | Apr 29, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer | Apr 19, 2024 | DecoderOptical Character Recognition | —Unverified | 0 |
| Resilience of Large Language Models for Noisy Instructions | Apr 15, 2024 | Automatic Speech RecognitionOptical Character Recognition | —Unverified | 0 |
| TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model | Apr 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Making Old Kurdish Publications Processable by Augmenting Available Optical Character Recognition Engines | Apr 9, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement | Apr 8, 2024 | BinarizationDocument Enhancement | CodeCode Available | 2 |
| PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents | Mar 23, 2024 | ArticlesOptical Character Recognition | CodeCode Available | 1 |
| Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning | Mar 18, 2024 | Handwritten Digit RecognitionOptical Character Recognition | CodeCode Available | 0 |
| OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System | Mar 18, 2024 | AllDecision Making | —Unverified | 0 |
| Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning | Mar 17, 2024 | Edge DetectionLine Detection | —Unverified | 0 |
| Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering | Mar 14, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking | Mar 13, 2024 | Chinese Spell CheckingIn-Context Learning | —Unverified | 0 |
| LOCR: Location-Guided Transformer for Optical Character Recognition | Mar 4, 2024 | MarketingOptical Character Recognition | —Unverified | 0 |
| Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction | Mar 1, 2024 | DecoderOptical Character Recognition | —Unverified | 0 |
| ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting | Mar 1, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Representing Online Handwriting for Recognition in Large Vision-Language Models | Feb 23, 2024 | Handwriting RecognitionOptical Character Recognition | —Unverified | 0 |
| Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing | Feb 12, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |