SOTAVerified

Optical Character Recognition

Papers

Showing 76100 of 526 papers

TitleStatusHype
MCSCSet: A Specialist-annotated Dataset for Medical-domain Chinese Spelling CorrectionCode1
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?Code1
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Detection of Furigana Text in ImagesCode1
Combining Morphological and Histogram based Text Line Segmentation in the OCR ContextCode1
Fully Unsupervised Diversity Denoising with Convolutional Variational AutoencodersCode1
DocParser: End-to-end OCR-free Information Extraction from Visually Rich DocumentsCode1
Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documentsCode1
Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image ClassificationCode1
PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific DocumentsCode1
Boosting on the shoulders of giants in quantum device calibrationCode1
Toxicity of the Commons: Curating Open-Source Pre-Training DataCode1
Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven ApproachCode1
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
A Study of Autoregressive Decoders for Multi-Tasking in Computer VisionCode0
ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode0
A Skip-connected Multi-column Network for Isolated Handwritten Bangla Character and Digit recognitionCode0
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction EncodingCode0
Are VLMs Really BlindCode0
Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer LearningCode0
A model of diffuse Galactic Radio Emission from 10 MHz to 100 GHzCode0
Show:102550
← PrevPage 4 of 22Next →

No leaderboard results yet.