| MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts | Feb 24, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding | Feb 20, 2025 | document understandingOptical Character Recognition | —Unverified | 0 |
| Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models | Feb 18, 2025 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| Visual Graph Question Answering with ASP and LLMs for Language Parsing | Feb 13, 2025 | Graph Question AnsweringOptical Character Recognition | —Unverified | 0 |
| Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments | Feb 10, 2025 | BenchmarkingOptical Character Recognition | CodeCode Available | 1 |
| Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents | Feb 6, 2025 | Image CaptioningOptical Character Recognition | —Unverified | 0 |
| LoCoML: A Framework for Real-World ML Inference Pipelines | Jan 24, 2025 | Automatic Speech RecognitionMachine Translation | —Unverified | 0 |
| Exploring AI-based System Design for Pixel-level Protected Health Information Detection in Medical Images | Jan 16, 2025 | De-identificationOptical Character Recognition | —Unverified | 0 |
| Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway | Jan 13, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis | Jan 8, 2025 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 |