| VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text | Jun 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images | Apr 29, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents | Mar 23, 2024 | ArticlesOptical Character Recognition | CodeCode Available | 1 |
| ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting | Mar 1, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Data Generation for Post-OCR correction of Cyrillic handwriting | Nov 27, 2023 | Handwriting generationHandwritten Text Recognition | CodeCode Available | 1 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| GenKIE: Robust Generative Multimodal Document Key Information Extraction | Oct 24, 2023 | DecoderKey Information Extraction | CodeCode Available | 1 |
| Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks | Oct 8, 2023 | BinarizationCPU | CodeCode Available | 1 |