SOTAVerified

Optical Character Recognition

Papers

Showing 101125 of 526 papers

TitleStatusHype
Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
Ancient but Digitized: Developing Handwritten Optical Character Recognition for East Syriac Script Through Creating KHAMIS Dataset0
Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese0
Large Language Models for Page Stream Segmentation0
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry AreaCode2
SWIFT:A Scalable lightWeight Infrastructure for Fine-TuningCode11
Revisiting Multi-Modal LLM Evaluation0
Handwritten Code Recognition for Pen-and-Paper CS EducationCode0
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
PIXELMOD: Improving Soft Moderation of Visual Misleading Information on TwitterCode0
Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation0
ChatSchema: A pipeline of extracting structured information with Large Multimodal Models based on schema0
PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips0
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition0
Task-driven single-image super-resolution reconstruction of document scans0
Toward accessible comics for blind and low vision readers0
Spanish TrOCR: Leveraging Transfer Learning for Language AdaptationCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
Optimizing Nepali PDF Extraction: A Comparative Study of Parser and OCR TechnologiesCode0
Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription0
OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst0
M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine TranslationCode0
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded TextCode1
Scaling Automatic Extraction of Pseudocode0
Show:102550
← PrevPage 5 of 22Next →

No leaderboard results yet.