| Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation | Oct 15, 2024 | License Plate RecognitionOptical Character Recognition | —Unverified | 0 |
| MIRAGE: Multimodal Identification and Recognition of Annotations in Indian General Prescriptions | Oct 13, 2024 | Handwriting RecognitionOptical Character Recognition | —Unverified | 0 |
| ChartKG: A Knowledge-Graph-Based Representation for Chart Images | Oct 13, 2024 | Chart Question AnsweringKnowledge Graph Completion | —Unverified | 0 |
| Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Oct 11, 2024 | Handwritten Text RecognitionHTR | CodeCode Available | 1 |
| JaPOC: Japanese Post-OCR Correction Benchmark using Vouchers | Sep 30, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| See then Tell: Enhancing Key Information Extraction with Vision Grounding | Sep 29, 2024 | Image to textKey Information Extraction | —Unverified | 0 |
| CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials | Sep 27, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Sep 25, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| @Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology | Sep 21, 2024 | BenchmarkingDepth Estimation | —Unverified | 0 |
| Computer Vision Intelligence Test Modeling and Generation: A Case Study on Smart OCR | Sep 14, 2024 | 3D ClassificationOptical Character Recognition | —Unverified | 0 |