SOTAVerified

Optical Character Recognition

Papers

Showing 5175 of 526 papers

TitleStatusHype
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
Operationalizing a National Digital Library: The Case for a Norwegian Transformer ModelCode1
An Empirical Study of Scaling Law for OCRCode1
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional NetworksCode1
A Two-Step Approach for Automatic OCR Post-CorrectionCode1
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video EnvironmentsCode1
Efficient OCR for Building a Diverse Digital HistoryCode1
Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example MiningCode1
GenKIE: Robust Generative Multimodal Document Key Information ExtractionCode1
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth EvaluationCode1
Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven ApproachCode1
Exploring Better Text Image Translation with Multimodal CodebookCode1
Combining Morphological and Histogram based Text Line Segmentation in the OCR ContextCode1
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali DocumentsCode1
Fully Unsupervised Diversity Denoising with Convolutional Variational AutoencodersCode1
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
Geometry Restoration and Dewarping of Camera-Captured Document ImagesCode1
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text SpotterCode1
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
Boosting on the shoulders of giants in quantum device calibrationCode1
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and RecognitionCode1
Iranis: A Large-scale Dataset of Farsi License Plate CharactersCode1
BankNote-Net: Open dataset for assistive universal currency recognitionCode1
A Large Multi-Target Dataset of Common Bengali Handwritten GraphemesCode1
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Show:102550
← PrevPage 3 of 22Next →

No leaderboard results yet.