| Efficient Video-Based ALPR System Using YOLO and Visual Rhythm | Jan 4, 2025 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 0 |
| Multi-modal Page Stream Segmentation with Convolutional Neural Networks | Sep 27, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing | Nov 20, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning | Mar 18, 2024 | Handwritten Digit RecognitionOptical Character Recognition | CodeCode Available | 0 |
| MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts | Feb 24, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 |
| MultiQG-TI: Towards Question Generation from Multi-modal Sources | Jul 7, 2023 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| ASTER: An Attentional Scene Text Recognizer with Flexible Rectification | Jun 25, 2018 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters | Jan 1, 2024 | Multi-Task LearningOptical Character Recognition | CodeCode Available | 0 |
| Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis | Jan 8, 2025 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 |