| PIXELMOD: Improving Soft Moderation of Visual Misleading Information on Twitter | Jul 30, 2024 | MisinformationOptical Character Recognition | CodeCode Available | 0 |
| ChatSchema: A pipeline of extracting structured information with Large Multimodal Models based on schema | Jul 26, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation | Jul 26, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips | Jul 22, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition | Jul 18, 2024 | DecoderHandwriting Recognition | —Unverified | 0 |
| Task-driven single-image super-resolution reconstruction of document scans | Jul 12, 2024 | Image Super-ResolutionOptical Character Recognition | —Unverified | 0 |
| Toward accessible comics for blind and low vision readers | Jul 11, 2024 | Optical Character RecognitionPrompt Engineering | —Unverified | 0 |
| Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation | Jul 9, 2024 | DecoderImage Generation | CodeCode Available | 0 |
| High-Throughput Phenotyping using Computer Vision and Machine Learning | Jul 8, 2024 | Image SegmentationOptical Character Recognition | CodeCode Available | 0 |
| Optimizing Nepali PDF Extraction: A Comparative Study of Parser and OCR Technologies | Jul 5, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |